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Invention Title: 

Method of producing plants having enhanced transpiration efficiency and plants 
produced therefrom I 

The invention is described in the following statement: 
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FIELD OF THE INVENTION 

The present invention relates to the field of plant breeding and the production of 
genetically engineered plants. More specifically, the invention described herein 
provides genes that are capable of enhancing the transpiration efficiency of a plant 
5 when expressed therein. These genes are particularly useful for the production of 
plants having enhanced transpiration efficiency, by both traditioned plant breeding 
and genetic engineering approaches. The invention further extends to plants 
produced by the methods described herein. 

10 BACKGROUND TO THE INVENTION 
1. General 

This specification contains nucleotide and amino acid sequence information 
prepared using Patentin Version 3,1, presented herein after the claims. Each 
nucleotide sequence is identified in the sequence HstLng by the numeric indicator 
<210> followed by the sequence identifier (e.g. <210>1, <210>2, etc). The length 

15 and type of sequence (DNA, protein (PRT), etc), and source orgemism for each 
nucleotide sequence, are indicated by information provided in the nimieric indicator 
fields <211>, <212> and <213>, respectively. Nucleotide sequences referred to in 
the specification are defined by the term "SEQ ID NO:", followed by the sequence 
identifier (eg. SEQ ID NO: 1 refers to the sequence in the sequence listing designated 

20 as <400>1). 

The designation of nucleotide residues referred to herein are those recommended by 
the lUPAC-IUB Biochemical Nomenclature Commission, wherein A represents 
Adenine, C represents Cytosine, G represents Guanine, T represents thjonine, Y 

25 represents a pyrimidine residue, R represents a purine residue, M repriasents 
Adenine or Cytosine, K represents Guanine or Thymine, S represents Guanine or 
Cytosine, W represents Adenine or Thymine, H represents a nucleotide other than 
Guanine, B represents a nucleotide other than Adenine, V represents a nucleotide 
other than Thymine, D represents a nucleotide other than Cytosine and N represents 

30 any nucleotide residue. 
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As used herein the term "derived from" shall be taken to indicate that a specified 
integer is obtained kovoi a particular source albeit not necessarily directly from that 
source. 

5 Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated step or element or integer or group of steps or 
elements or integers but not the exclusion of any other step or element or integer or 
group of elements or integers. 

10 

Those skilled in the art will appreciate that the invention described herein is 
susceptible to variations and modifications other than those specifically described. It 
is to be understood that the invention includes all such variations and modifications. 
The invention also includes ail of the steps, features, compositions and compounds 
15 referred to or indicated in this specification, individually or collectively, and any 
and all combinations or any two or more of said steps or features. 

The present invention is not to be limited in scope by the specific embodiments 
described herein, which are intended for the purposes of exempHfication only. 
20 Functionally equivalent products, compositions and methods are clearly within the 
scope of the invention, as described herein. 

2, Description of the related art 

It is well known that virtually all plants require a certain quantity of water for proper 
growth and development, because CO2 fixation and photosynthate assimilation by 
plants cost water, A significant quantity of water absorbed by plants from the soil 
25 returns to the atmosphere via plant transpiration. 

Transpiration efficiency is a measiure of the amount of dry matter produced by a 
plant per unit of water transpired, or, in other words, carbon gain relative to water 
lost through transpiration. 

30 

For plants having low transpiration efficiency, or when water is in short supply, the 
loss of water through transpiration can limit key metabolic processes associated with 
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plant growth and development For exeanple, during drought, or when plants having 
low transpiration efficiency are grown in arid and semi-arid environments, plant 
productivity as determined by dry matter production or photos5mthetic rate, is 
considerably reduced. Accordingly, the production of plants having enhanced water 
5 use efficiency or transpiration efficiency is highly desirable for their adaptation to 
arid or semi-arid conditions, or to enhance their drought resistance. 

The enhancement of water use efficiency or transpiration efficiency by plants is also 
highly desirable in consideration of global climatic change and increasing pressiure 

10 on world water resources. The inefficient utilization of agricultural water is known 
to impact adversely upon the supply of navigable water, potable water, emd water for 
industrial or recreational use. Accordingly, the production of plants having 
enhanced transpiration efficiency is highly desirable for reducing the pressure on 
these water resom-ces. It is also desirable for increasing plant productivity under 

15 well-watered conditions. 

By enhancing transpiration efficiency, carbon gain rates are enhanced per xmit of 
water transpired, thereby stimulating plant growth under well-watered conditions, 
or alternatively, under mild or severe drought conditions. This is achieved by 
20 enhancing carbon gain more than transpiration rate, or by reducing the amount of 
water lost at any particular rate of carbon fixation. Those skilled in the art also 
consider that for a given growth rate plemts having enhanced transpiration efficiency 
dry out soils more slowly, and use less water, than less efticient near-isogenic plants. 

25 Several chemical as well as environmental pre-treatments have been described for 
enhancing the ability of plant seedlings to survive drought, either by reducing 
transpiration or by reducing the amovmt of water that is actually lost to the 
atmosphere. 

30 Known environmental treatments largely involve the use of physical barriers. Whilst 
placing a physical barrier over plant stomata is known to reduce water loss via 
transpiration, the procedure is not always desirable or practicable for field-grown 
crops. For example, physical barriers over plant stomata may inhibit certain gas- 
exchange processes of the plant. It is more desirable to enhance actual transpiration 
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efficiency or water use efficiency of the plant through manipulation of intrinsic plant 
function. 

Chemical agents are typically the so-called "anti-transpirant" or "anti-desiccant" 
5 agents, both of which are applied to the leaves. Anti-transpirants are typically films 
or metabolic anti-transpirants. 

These products form a film on leaves, thereby either blocking stomatal pores, or 
coating leaf epidermal cells with a water-proof fQm. T3^ical film anti-transpirants 

10 include waxes, wax-oil emulsions, higher alcohols, silicones, plastics, latexes euid 
resins. For example, Elmore, United States Patent No. 4,645,682 disclosed an anti- 
transpirant consisting of an aqueous paste wax; Cushman et al., United States Patent 
Nos. 3,791,839 and 3,847,641 also disclosed wax emulsions for controlling 
transpiration in plants; and Petrucco et al., United States Patent No. 3,826,671, 

15 disclosed a polymer composition said to be effective for controlling transpiration in 
plants. 

MetaboUc anti-transpirants generally close stomata, thereby reducing the rate of 
transpiration. Ts^ical metabolic anti-transpirants include succinic acids, 
- 20 phenylmercuric acetate, hydroxystilfonates, the herbicide atrazine, sodium azide, 
and phenylhydrazones, as well as ccurbon cyanide. 

Compounds having plant growth regulator activity have also been shown to be 
useful for reducing transpiration. For example, Bliesner e^ al.. United States Patent 

25 No. 4,671,816, disclosed an acetylene compound, said to possess utility for 
regulating plant growth, whilst Kuznetsov et al. (Russian Patent No. SU 1,282,492; 
Russian Patent Application No. SU 1,253,559-Al), and Smimov et al (Russian Patent 
No. SU 1,098,934) disclosed the use of derivatives of 2-methyl-5- 
hydroxyben imidazole, and the chloride or bromide salts thereof, as antL-treinspirant 

30 growth regulators. Vichnevetskaia (USSN 5,589,437 issued December 31, 1996) also 
describe hydroxybenzimidazole derivatives for enhancing the drought resistance of 
plants by reducing transpiration, however have the advantage of being applicable to 
plant seed or roots. Schulz et al., United States Patent No. 4,943,315, also disclosed 
formulations comprising an acetylene and a phenylbenzylurea compoimd, for 
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reducing treuxspiration in pleints and/or for avoiding impairment to pleaits caused by 
heat €ind dry conditions. Abscisic acid has also been shown to reduce or suppress 
transpiration in plants (eg. Helv, Chim. Acta, 71, 931, 1988; /. Org. Chem., 54, 681, 
1989; and Japanese Patent Publication No. 184,966/1991). 

5 

Metabolic anti-transpirants are costly to produce and often exhibit ph5^otoxic effects 
or inhibit plant growth KozlowskL (1979), In: Tree Growth and Environmental 
Stresses (Univ. of Washington Press, Seattle and London), and are not practically 
used. 

10 

Recent studies have examined alternative methods for enhancing transpiration 
efficiency, particularly breeding approaches to select lines that grow more efficiently 
under mild drought conditions. Carbon isotope discrimination has been used to 
identify Arabidopsis ecotypes with contrasted transpiration efficiencies (Masle et 

15 a/.. In: Stable isotopes and plant carbon-water relations, Acad* Press, Physiol. Ser., 
pp371-386, 1993) and to eissist conventional breeding of new plant varieties in a 
niunber of species (Hall et al.. Plant Breeding Revieivs 4, 81-113, 1994) including rice 
(Farquhar et al,, In: Breaking the Yield Barrier, ed KG Cassman, IRRI, 95, 101) and 
most recently wheat (Rebetzke et al. Crop Science 42:739-745, 2002), 

20 No single gene has been identified as being capable of enhancing transpiration 
efficiency when expressed in planta. Transpiration efficiency may well be 
multigenic. As a consequence, the genes and signalling pathways that regulate the 
photosynthetic and/or stomatal components of the transpiration efficiency 
mechanism in plants have not been identified or characterized. 

25 

Moreover, notwithstanding that the effect of down-regulatiag expression of the 
Rubisco gene, or mutation in genes involved in abscisic acid (eg. aba, abi], are 
known to modify transpiration efficiency to some extent through stomatal closure, 
the consequence of such modifications is not totaUy specific, resulting in pleiotropic 
30 effects. 



Arabidopsis thaliana ecotype Landsberg erecta (L-er) is one of the most popular 
ecotypes and is used widely for both molecular and genetic studies. It harbors the 
erecta (er) mutation, which confers a compact inflorescence, blunt fruits, and short 



petioles. There are a number of er mutant alleles. Phenotypic characterization of the 
mutant alleles siiggests a role for the wild type ER gene in regulating plant 
morphogenesis, peurticularly the shapes of organs that originate from the shoot apical 
meristem. Torii et al., The Plant Cell 8, 735, 1996, showed that the ER gene encodes 
5 a putative receptor protein kinase comprising a cytoplasmic protein kinase catal5rtic 
domain, a transmembreme region, and an extracellular domain consisting of leucine- 
rich repeats, which are thought to interact with other macromolecules. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 Figiure la is a graphical representation showing the CO2 assimilation rates (|amol C 
m^ s'^) of several genotypes of A thaliana. The genotypes of plants are indicated on 
the X-axis, and CO2 assimilation rates indicated on the ordinate. Col indicates a 
genetic background of the ecotype Columbia. Ld indicates a genetic background of 
the ecotype Landsberg. Plants expressing wild type ERECTA alleles were either in a 

15 Col (Col4-£fl] or Ld (Ld-£7?) background. Plants that were homozygous for a mutant 
er allele were either in a Col background [Col-erlOS or Col-er2) or a Ld background 
(Ld-eri). Plants designated as Fl (Col-£R x Ld-er) were heterozygous ER/erl. Data 
indicate that, in a Col background, the erl05 mutation leads to reduced COg 
assimilation rate, whilst the erl mutation enhances COg assimilation rate in a Ld 

20 background. 

Figure lb is a graphical representation showing the stomatal conductance (mol HjO 
m^ s'^) of several genotypes of A thaliana. The genotypes of plants are indicated on 
the x-axis and are the same as .described in the legend to Figure la. Stomatal 
25 conductances are indicated on the ordinate. Data indicate that, in a Col background, 
the er2 mutation significantly enhances stomatal conductance, whilst the erl 
mutation significantly enhances stomatal conductance in a Ld backgroimd. 

Figure Ic is a graphical representation showing the transpiration efficiency of (mmol 
30 C mol HgO"^) of several genotypes of A thaliana, as determined by the ratio of CO2 
assimilation rate to stomatal conductance. The genotypes of plants are indicated on 
the X-axis and are the same as described in the legend to Figure la. Transpiration 
efficiency is indicated on the ordinate. Data indicate that transpiration efficiency is 
enhanced in plants expressing a wild type ER allele relative to a mutant er allele, in 



both Ld and Col backgrounds. The lowest transpiration efficiency was observed for 
plants that are homozygous for the erl05 allele (ie. CoVerlOS), consistent with the 
fact that this allele disrupts ERECTA expression. From the data in Figures la-lc, it is 
apparent that the lower transpiration efficiency of plants expressing the erl05 allele 
5 is largely due to a reduced CO2 fixation rate, whereas for both the er2 and erl alleles, 
reduced transpiration efficiency is largely due to enhanced stomatal conductance. 
The transpiration efficiency of the Fl heterozygote plant was intermediate between 
the transpiration efficiencies of its parents, suggesting codominance of these alleles. 
The Fl, however, had a transpiration efficiency closer to that of the pollen donor 
10 p€urent« Ld-erJ . 

Figure 2a is a graphical representation showing the stomatal densities (Number of 
stomata mm'^ leaf) for several genotypes of A thaliana in three independent 
experiments. The genetic backgroimds of plants are indicated on the x-axis (Col, 

15 Columbia; Ld, Landsberg), and stomatal densities are indicated on the ordinate. 
Plant genotypes are indicated at the top of each bar, as follows: plants expressing 
wild type ERECTA alleles in a Col backgroimd were Col4Efl or CollER (hatched 
bars); plants expressing wild type ERECTA alleles in a Ld backgroimd were ER (open 
bars); plants expressing mutant erecta alleles in a Col background were either erl 05 

20 or er2 (stippled boxes); and plants expressing the mutant erl allele in a Ld 
backgroimd were erl (filled boxes). Columns designated a,b are data from two 
experiments where plants were grown in soil in the absence of fertiLUser. The set of 
columns at the right of the figme are from a third experiment where the same plants 
were grown in soil comprising fertiliser. Data indicate that, in a Col background, the 

25 erl 05 mutation and er2 mutation enhances stomatal density, which in part accounts 
for the enhanced stomatal conductances and reduced transpiration efficiencies of 
plants expressing these alleles (Figures lb and Ic). The general effects of these 
alleles is not dependent on the nutrient status of the soil. In contrast, the erl allele 
enhanced stomatal density of Ld plants in only one case when, fertiliser was absent, 

30 suggesting that enhanced stomatal aperture may account for the enhanced stomatal 
conductances and reduced transpiration efficiencies of Ld^erl plants (Figures lb, 
Ic). 
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Figure 2b is a graphical representation showing the epidermal cell size (surface area, 
(jun^) for several genotypes of A thaliana in three independent experiments. The 
genetic backgrounds and genotypes of plants are indicated on the x-axis and at the 
tops of each column, respectively, as in the legend to Figure 2a. The ordinate 
5 indicates epidermal cell size. Col\unns designated a,b are data from two 
experiments where plants were grown in soil in the absence of fertOiser. The set of 
colimms at the right of the figure are from a third experiment where the same plants 
were grown in soil comprising fertiliser. Data indicate that, in a Col backgroimd, the 
erl05 mutation and er2 mutation significantly reduce epidermal cell size ie increase 

10 the number of epidermal cells per unit leaf area. This reveals that the ER gene has 
effects on leaf histogenesis which, beyond their consequences on stomatal densities, 
may also directiy affect leaf capacity for photosynthesis and therefore transpiration 
efficiency, (Figures lb and Ic). The general effects of these alleles is not dependent 
on the nutrient status of the soil. In contrast, the erl allele reduced epidermal cell 

15 size of Ld plants in only one case when fertiliser was absent, suggesting that 
enhanced stomatal aperture accounts for the enhanced stomatal conductances and 
reduced transpiration efficiencies of Ld-er2 plants (Figures lb, Ic). 

Figure 2c is a graphical representation showing the stomatal index for several 
20 genotypes of A thaliana in three independent experiments. The genetic 
backgroimds and genotypes of plants are indicated on the x-axis and at the tops of 
each column, respectively, 6is in the legend to Figure 2a. The ordinate indicates 
stomatal index, as determined bom the ratio of stomatal density to epidermal cell 
density. Colvunns designated a,b are data firom two experiments where plants were 
25 grown in soil in the absence of fertiliser. The set pf columns at the right of jhe figure 
are from a third experiment where tiie same plants were grown in soil comprising 
fertiliser. Data indicate that the er mutations tested do not significantiy modify 
stomatal index, because increases in stomatal density are correlated to increases in 
epidermal cell numbers in the mutant plants. Accordingly, the BR gene does not 
30 appear to directiy modify stomatal development per se. 

SUMMARY OF THE INVENTION 

In work leading up to tiie present invention, the inventors sought to elucidate the 
specific genetic determinants of plant transpiration efficiency. In plants, the 
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development of molecvJar genetic markers, such as, for example, genetic markers 
that map to a region of the genome of a crop plant, such as, for example, a region of 
the rice genome, medze genome, barley genome, sorghum genome, or wheat genome, 
or a region of the tomato genome or of any Brassicaceae , assists in the production of 
5 plants having enhanced tremspiration efficiency (Edwards et al., Genetics 116, 113 - 
125, 1987; Paterson et al.. Nature 335, 721-726, 1988). 

The present inventors identified a locus that is Linked to the genetic variation in 
transpiration efficiency in plants. To elucidate a locus associated with the 

10 tremspiration efficiency of plants, the inventors established experimental conditions 
and sampling procedures to determine the contribution to total transpiration 
efficiency of the factors influencing this phenotype, and, more particularly, the 
genetic contribution to the total variation in transpiration efficiency. Factors 
influencing transpiration efficiency include, for example, genotype of the plant, 

15 environment (eg. temperature, Ught, hiunidity, boundary layer aroxmd the leaves, 
root growth conditions), development (eg. age and/or stage and/or posture of plants 
that modifies gas exchange and/or carbon metabotism), and seed-specffic factors 
(Masle et al. 1993, op. cit). The screens developed by the inventors were also used to 
sinvey mutant and wild type populations for variations in transpiration efficiency 

20 and to identify ecot3^es having contrasting transpiration efficiencies including the 
parental lines that had been used by Lister and Dean (1993) . The transpiration 
efficiencies of the members of Lister and Dean's (1993) Recombinant Inbred Line 
(RIL) mapping population were then determined, and linkage analyses were 
performed against genetic markers to determine the chromosome regions that are 

25 Hnked to genetic variation in transpiration efficiency, thereby identifying a locus 
conditioning transpiration efficiency. 

In the exemplified embodiment of the invention, there is provided a locus associated 
witii transpiration efficiency of A thaliana, said locus defined as the ERECTA locus 
30 on A. thaliana chromosome 2. The present invention clearly extends to homologs of 
the A thaliana ERECTA locus firom other plant species, identified using the methods 
described herein. 
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Accordingly, one aspect of the invention provides a locus associated with the genetic 
variation in transpiration efficiency of a plant, wherein sedd locus comprises a 
nucleotide sequence linked genetically to the A thaliana ERECTA locus, 

5 As used herein, the terms " genetically linked" and "map to" shall be taken to refer to 
a sufficient genetic proximity between a linked nucleic acid comprising a gene, 
allele, marker or other nucleotide sequence and nucleic acid comprising all or part of 
the A thaliana ERECTA locus or all or part of a homolog of the A thaliana ERECTA 
locus from em.other plant species, to permit said linked nucleic acid to be useful for 

10 determining the presence of a particular allele of sedd A thaliana ERECTA locus or 
said homolog. Those skilled in the art will be aware that for such linked nucleic 
acid to be used in this manner, it must be sufficiently close to said locus not to be in 
linkage disequilibriiun or to have a high recombination frequency between said 
linked nucleic acid and said locus. Preferably, the linked nucleic acid and the locus 

15 are less than about 25cM apart, more preferably less than about lOcM apart, even 
more preferably less than about 5cM apart, still even more preferably less than about 
3cM apart and still even more preferably less than about IcM apart. 

Preferably, all or part of the locus of the invention is provided as recombinant or 
20 isolated nucleic acid, such as, for example, in the form of a gene construct (eg, a 
recombinant plasmid or cosmid], to faciUtate germplasm screening. 

The ERECTA locus or a gene that is linked to the ERECTA locus is particularly useful 
in a breeding progremi, to predict the transpiration efficiency of a pleint, or 

25 altematively, as a selective breeding inarker to select plants having enhanced 
tremispiration efficiency. Once mapped, marker-assisted selection (MAS) is used to 
introduce the ERECTA locus or markers linked thereto into a wide variety of 
populations. MAS has the advemtage of reducing the breeding population size 
required, and the need for continuous recurrent testing of progeny, and the time 

30 required to develop a superior line. 



Accordingly, a second aspect of the invention provides a method of selecting a plant 
having enhanced tremspiration efficiency, comprising: 
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(a) identifyirLg a locus associated with genetic vaiiatibn in transpiration 
efficiency in a plant; and 

(b) selecting a plant that comprises or expresses a gene that maps to the locus. 

5 Preferably, this aspect of the invention provides a method of selecting a plant having 
enhanced transpiration efficiency, comprising: 

(a) identifying a locus on the Arabidopsis chromosome 2 (46-50.7 cM) associated 
with genetic variation in transpiration efficiency in a plant; and 
(b) selecting a plant that comprises or expresses a gene that maps to the locus. 

10 

An alternative embodiment provides a method of selecting a plant having enhanced 
transpiration efficiency, comprising selecting a plant that comprises or expresses a 
functionally equivalent homolog of a protein-encoding region of the ERECTA gene of 
A thaliana. 

15 

As exemplified herein, the inventors also identffied specffic geiies or alleles that are 
linked to the ERECTA locus and determine the transpiration efficiency of a plant. 
More partictilarly, the transpiration efficiencies of near-isogenic lines, wherein each 
line carries a mutation within the ERECTA locus were determined, thereby providing 

20 the genetic contribution of genes or alleles at the ERECTA locus to transpiration 
efficiency. This analysis allowed the inventors to assess the genetic contribution of 
particular alleles to transpiration efficiency, thereby determining alleUc variants that 
are Unked to a particular transpiration efficiency. Thus, the elucidation of the 
ERECTA locus for transpiration efficiency in plants faciUtated the fine mapping and 

25 determination of allehc variants that modiilate transpiration efficiency. 

Accordingly, a third aspect of the invention provides a method of identifying a gene 
that determines the treuispiration efficiency of a plant comprising: 

(a) identifying a locus associated with genetic variation in transpiration efficiency 
30 in a plant; 

(b) identifying a gene or allele that is linked to said locus, wherein said gene or 
allele is a candidate gene or allele for determining the transpiration efficiency 
of a plant; and 
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(c) determining the transpiration efficiencies of a panel of plants, wherein not all 
members of said panel comprise sedd gene or allele, and wherein variation in 
transpiration efficiency between the members of said panel indicates that sedd 
gene is involved in determining transpiration efficiency. 

5 

In another embodiment, the method comprises: 

(a) identifying a locus associated with genetic variation in transpiration efficiency 
in a plant; 

(b) identifying multiple alleles of a gene that is linked to said locus, wherein said 
10 gene is a candidate gene involved for determining the transpiration efficiency 

of a plant; and 

(c) determining the transpiration efficiencies of a panel of plants, wherein each 
member of said panel comprises at least one of said multiple alleles, wherein 
variation in treuispiration efficiency between the members of said peuael 

15 indicates that said gene is involved in determining transpiration efficiency. 

Preferably, the identified gene or allele identified by the method described in the 
preceding paragraph is an ERECTA allele, an erecta allele, a homolog of ERECTA 
allele, or a homolog of erecta allele, wherein said homolog is from a plant species 
20 other than A thaliana. 

The identffied gene or allele, including any homologs from a plant other than A 
thaliana, such as, for example, the wild-type EREGTA allele or a homolog thereof, is 
useful for the production of novel plants. Such plants are produced, for example, 
25 using recombinant techniques, or traditional plant breeding approaches such ets 
introgression. 

Accordingly, a still further aspect of the present invention provides a method of 
enhancing the transpiration efficiency of a plant comprising ectopically expressing 
30 in a plant an isolated gene comprising a nucleotide sequence that is homologous to a 
protein-encoding region of a gene of A thaliana that maps to the ERECTA locus on 
chromosome 2. A related embodiment of the invention provides a method of 
enhancing the transpiration efficiency of a plant comprising introgressing into said 
plant a nucleic acid comprising a nucleotide sequence that is homologous to a 
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protein-encoding region of a gene of A thaliana that maps to the ERECTA locus on 
chromosome 2. 

A further aspect of the invention provides for the use of an isolated gene comprising 
5 a nucleotide sequence that is homologous to a protein-encoding region of a gene of 
A thaliana that maps to the ERECTA locus on chromosome 2 in the preparation of a 
gene construct for enhancing the tremspiration efficiency of a plant. 

A fifth aspect of the present invention provides a plant having enhanced 
10 transpiration efficiency, wherein said plant is produced by a method described 
herein. 

DETAILED DESCRIPTION OF THE INVENTION 
Loci for transpiration efficiency and their identification 
15 One aspect of the invention provides a locus associated with the genetic variation in 
transpiration efficiency of a plant, wherein said locus comprises a nucleotide 
sequence linked genetically to the ERECTA locus on chromosome 2 or a homolog 
thereof. 

20 As used herein, the term "locus" shall be taken to mean the location of one or more 
genes in the genome of a plant that affects a quantitative characteristic of the plant, 
in particular the treuispiration efficiency of a plant. In the present context, a 
"quantitative characteristic" is a phenotype of the pleuit for which the phenotypic 
variation among different genotypes is continuous and cannot be separated into 

25 discrete classes, irrespective of the niunber of ^enes that determine or control the 
phenotype, or the magnitude of genetic effects that single, gene has in determining 
the phenotype, or the magnitude of genetic effects of interacting genes. 

By "associated with the genetic variation in transpiration efficiency of a plant" 
30 means that a locus comprises one or more genes that are expressed to determine or 
regulate the transpiration efficiency of a plant, irrespective of the actual rate of 
transpiration achieved by the pleuit under a specified environmental condition. 
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The present invention clearly contemplates the presence of multiple genes that are 
genetically linked or map to the specified ERECTA locus on chromosome 2. Without 
being bound by any theory or mode of action, such multiple linked genes may 
interact, such as, for excunple, by epistatic interaction, to determine the transpiration 
5 efficiency phenotj^e. 

The present invention also contemplates the presence of different alleles of any gene 
that is linked to the ERECTA locus, wherein said allele is expressed to determine the 
transpiration efficiency phenotype. In one embodiment, such alleles are identified 

10 by detecting a particular transpiration efficiency phenotype that is linked to the 
expression of the particular allele. Alternatively, or in addition, the different alleles 
linked to a locus are identified by detecting a structural polymorphism in DNA (eg. a 
restriction fragment length poljonorphism (RFLP), amplified fragment length 
polymorphism (AFLP), single strand chain poljonorphism (SSCP), and the Uke), that 

15 is linked to a particuleu: treinspiration efficiency phenotype. 

The present invention clearly encompasses all interacting genes and/or alleles that 
are genetically linked to an ERECTA locus and are expressed to determine a 
transpiration efficiency phenotype. Such linked interacting genes and/or alleles will 

20 map to a region of the genome of a plant that is homologous to a region of the 
Arabidopsis thaliana genome that is associated with the transpiration efficiency of 
that plant and maps to the ERECTA locus. Preferably, such interacting genes and/or 
alleles comprise a protein-encoding portion of a gene of A thalicaia positioned 
within the ERECTA locus of the Arabidopsis thaliana genome that is associated with 

25 the transpiration efficiency of that plant, or a homologous protein-encoding region 
from another plant species. 

The terms ''region of the Arabidopsis thaliana genome that is associated with the 
transpiration efficiency, and 'locus of A thaliana that determines the transpiration 
30 efficienc}r'' shall bp taken to mean that portion of chromosome 2 of A thaliana 
associated with transpiration efficiency, preferably the region from about 46cM to 
about 50.7 cM. 
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Even more preferably, the locus of the invention is linked to or comprises the 
EliECTA allele or the erecta alleles, or a protein-encoding portion thereof or a 
homologous gene from another plemt species. 

5 As used herein, the term "ERECTA" shall be taken to refer to a wild type A thaliana 
ERECTA allele or a homolog thereof from another plant species. 

For the purposes of nomenclature, the nucleotide sequence of the Arabidopsis 
thaliana ERECTA protein-encoding region, including 5'-imtranslated region (UTR) 
10 and 3'-UTR, is provided herein as SEQ ID NO: 1. The amino acid sequence of the 
polypeptide encoded by SEQ ID NO: 1 is set forth herein as SEQ ID NO: 2, 

The term "erecta" shall be taken to mean any alleUc variant of the wild-type ERECTA 
allele that modifies treaispiration efficiency of A thaliana or a homolog thereof from 

15 another plant species. Preferred erecta alleles contemplated herein include an A 
thaliana erecta allele selected from the group consisting of: er,.erls er2 allele, etlOl 
allele, eria2allele, erl03 allele, erl04 allele, erl05 allele, er2 allele, erl08 allele, 
erl09 allele, erllO allele, erlll allele, erllZ allele, erll3 allele, erll4 allele. erll5 
allele, erllG allele, erll7 allele, erll8 allele, erllQ allele, erl20 allele, erl21 allele, 

20 erl22 allele, erl23 allele (Lease et al. New Phytologist 151, 133-143, 2001) and a 
homolog of any one of said alleles. 

Those skilled in the art are aware that the terms "homolog" and "ortholog" refer to 
functional equivalent units. In the present context, a homolog or ortholog of a gene 
25 that maps to the ERECTA locus shall be taken to meem any gene from a plant species 
other than A thaliana that is functionally equivalent to a gene that maps to the 
exempUfied A thaliana ERECTA locus, and comprises a protein-encoding region in 
its native plant genome that shares a degree of structural identity or similarity with a 
protein-encoding region that is linked to the A thaliana ERECTA locus. 

30 

Preferably, a homologous or orthologous gene from a plant other than A thaliana 
will be associated with the transpiration efficiency of said plant and be linked to a 
protein-encoding region in its native plant genome that comprises a nucleotide 
sequence having at least about 55% overall sequence identity to a protein-encoding 



17 

region linked to the ERECTA locus. Even more preferably, the percentage identity 
will be at least about 59-61% or 70% or 80%, still more preferably at least about 
90%, and even still more preferably at least about 95%. 



5 In determining whether or not two nucleotide sequences fall within a particular 
percentage identity limitation recited herein, those skilled in the art will be aware 
that it is necessary to conduct a side-by-side comparison or multiple aHgnment of 
sequences. In such comparisons or aHgnments, differences may arise in the 
positioning of non-identical residues, depending upon the algorithm used to perform 

10 the alignment. In the present context, reference to a percentage identity between 
two or more nucleotide sequences shall be taken to refer to the number of identical 
residues between said sequences as determined using any standard algorithm known 
to those skilled in the art. For example, nucleotide sequences may be aligned and 
their identity calculated using the BESTFIT program or other appropriate program of 

15 the Computer Genetics Group, Inc., University Research Park, Madison, Wisconsin, 
United States of America (Devereaux et al, NucL Acids Res. 12, 387-395, 1984), 

Alternatively, or in addition, a homologous or orthologous gene from a plant other 
than A thaliana will be associated with the transpiration efficiency of said plant and 

20 be linked to a protein-encoding region in its native plant genome that comprises a 
nucleotide sequence that encodes a polypeptide having at least about 55% overall 
sequence identity to a polypeptide encoded by a protein-encoding region linked to 
the ERECTA locus. Preferably, the percentage identity at the amino acid level will be 
at least about 59-61% or 70% or 80%, more preferably at least about 90%, and stiU 

25 more preferably at least about 95%. 

In determining whether or not two amino acid sequences faU within these 
percentage hmits, those skilled in the art wiU be aware that it is necessary to conduct 
a side-by-side comparison or multiple aHgnment of sequences. In such comparisons 
or aUgnments. differences wiU arise in the positioning of non-identical residues, 
depending upon the algorithm used to perform the alignment In the present 
context, reference to a percentage identity or similarity between two or more amino 
acid sequences shall be taken to refer to the number of identical and similar residues 
respectively, between said sequences as determined using any standard algorithm 



30 
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known to those skilled in the art. For example, amino acid sequence identities or 
similarities may be calculated using the GAP program and/or aligned using the 
PILEUP program of the Computer Genetics Group, Inc., University Research Park, 
Madison, Wisconsin, United States of America (Devereaux et al, 1984, supra). The 
5 GAP progreun utiUzes the algorithm of Needleman and Wunsch, /. MoL BioL 48, 443- 
453, 1970, to maximize the number of identical/similar residues and to minimize the 
niunber and length of sequence gaps in the alignment. Alternatively or in addition, 
wherein more than two amino acid sequences are being compared, the CliistalW 
program of Thompson et al., Nucl. Acids Res. 22, 4673-4680, 1994, is used. 

10 

Alternatively, or in addition, a homologous or orthologous gene from a plant other 
than A thaliana will be associated with the transpiration efficiency of said plant and 
be linked to a protein-encoding region in its native plant genome that hybridizes to 
nucleic acid that comprises a sequence complementary to a protein-encoding region 
15 linked to the A thaliana ERECTA locus. Preferably, such homologs or orthologs will 
be identified by hybridization under at least low stringency conditions, and more 
preferably under at least moderate stringency or high stringency hybridization 
conditions. 

20 For the purposes of defining the level of stringency, a low stringency is defined 
herein as being a hybridization or a wash carried out in 6xSSG buffer, 0.1% (w/v) 
SDS at 28*^C or alternatively, as exempHfied herein. Generally, the stringency is 
increased by reducing the concentration of salt in the hybridization or wash buffer, 
such as, for example, by reducing the concentration of SSC. Altematively, or in 

25 addition, the stringency is increased, by increeising the concentration of detergent 
(eg. SDS). Alternatively, or in addition, the stringency is increased, by increasing the 
temperature of the hybridization or weish. For example, a moderate stringency can 
be performed using 0,2xSSC to 2xSSC buffer, 0.1% (w/v) SDS, at a temperature of 
about 42*^0 to about 65°C, Similarly, a hi^ stringency can.be performed using 

30 O.lxSSC to 0.2xSSC buffer, 0,1% (w/v) SDS, at a temperature of at least 55°C. 
Conditions for performing nucleic acid hybridization reactions, and subsequent 
membrane washing, are well understood by one normally skilled in the art For the 
piuposes of ftirther clarification only, reference to the parameters affecting 
hybridization between nucleic acid molecules is foimd in Ausubel ef a/.. In: Current 
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Protocols in Molecular Biology, Greene/Wiley, New York USA, 1992, which is herein 
incorporated by reference. 

Preferably, the homologous gene is derived from, or present in, the genome of a plant 
5 that is desiccation or drought intolerant, or poorly adapted for growth in dry or arid 
environments, or that suffers from reduced vigor or growth during periods of 
reduced rainfall or drought, or from the genome of a plant with increased growth 
rate or growth duration or partitioning of C to shoot and harvested parts imder well- 
watered conditions. 

10 

More preferably, the homologous gene is derived from, or present in, the genome of a 
brassica plant, broad acre crop plant, perennial grass (eg. of the subfamily Pooidaea, 
or the Tribe Poeae), or tree. Even more preferably, the homologous locus is in the 
genome of a plant selected from the group consisting of barley, wheat, rye, sorghum, 
15 rice, maize, Phalaris aquatica, Dactylus glomerata, Loliiim perenne, Festuca 
arundinacea, cotton, tomato, soybeem, oilseed rape, poplar, and pine. 

A particularly preferred homolog of a gene that maps to the exemplified Arabidopsis 
thaliana ERECTA locus is derived from chromosome 6 of rice [Oiyza sativa], and, 
20 more preferably is linked to an ERECTA gene derived from rice. For the purposes of 
nomenclature, the protein-encoding region of the rice ERECTA gene is provided 
herein as SEQ ID NO: 3, The amino acid sequence of the polypeptide encoded by 
SEQ ID NO: 3 is set forth herein as SEQ ID NO: 4, . 

25 Another particularly preferred homolqg of a gene that maps to the exemplified 
Arabidopsis thaliana ERECTA locus is derived from the genome of Sorghum bicolor, 
and, more preferably is Unked to an ERECTA gene derived from sorghinn. For the 
purposes of nomenclature, the . protein-encoding region of the sorghum ERECTA 
gene is provided herein as SEQ ID NO: 5. The amino acid sequence of the 
, 30 polypeptide encoded by SEQ ID NO: 5 is set forth herein as SEQ ID NO: 6, 

Another particularly preferred homolog of a gene that maps to the exemplified 
Arabidopsis thaliana ERECTA locus is derived from the genome of A thaliana, and, 
more preferably is linked to an ERECTA gene derived from A thaliana. For the 
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purposes of nomenclature, the protein-encoding region of the A ihaliana ERECTA 
homologue is provided herein eis SEQ ID NO: 7. The amino acid sequence of the 
polypeptide encoded by SEQ ID NO: 7 is set forth herein as SEQ ID NO: 8. 

5 Another particularly preferred homolog of an ERECTA gene is provided herein as 
SEQ ID NO: 9. The amino acid sequence of the polypeptide encoded by SEQ ID NO: 
9 is set forth herein as SEQ ID NO: 10. 

A nvunber of mapping methods for determining useful loci and estimating their 

10 effects have been described (eg, Edwards et al.. Genetics 116, 113-125, 1987; Haley 
and Knott, Heredity 69, 315-324, 1992; Jiang and Zeng, Genetics 140, 1111-1127, 
1995; Lander and Botstein, Genetics 121, 185-199, 1989; Jansen and Stam, Genetics 
136, 1447-1455, 1994; Utz and Melchinger, In: Biometrics in Plant Breeding: 
Applications of Molecular Markers. Proc. Ninth Meeting of the EUCARPIA Section 

15 Biometrics in Plant Breeding, 6-8 July 1994, Wageningen, The Netherlands, (J.W. 
van Ooijen and J, Jansen, eds), ppl95-204, 1994; Zeng, Genetics 136, 1457-1468, 
1994). In the present context, these methods are applied to identify the major 
component(s)of the total genetic variance that contribute(s) to the variation in 
transpiration efticiency of a plant, such as, for example, determined by the 

20 measurement of carbon isotope discrimination (A). More particularly, the 
segregation of known markers is used to map and/or characterize an underlying 
locus associated with transpiration efficiency. The locus metiiod involves searching 
for associations between the segregating molecular markers and transpiration 
efficiency in a segregating population of plants, to identify the linkage of the marker 

25 to the locus. 

To discover a markerAocus linkage, a segregating poptdation is required. 
Experimental populations, such as, for example, an F2 generation, a backcross (BC) 
population, recombinant inbred line (RIL), or double haploid line (DHL), can be used 
30 as a mapping population. Bulk segregant analysis, for the rapid detection of markers 
at specific genomic regions using segregating populations, is described by 
Michelmoore et al., Proc. Natl Acad Scf. (USA) 88, 9828-9832, 1991. In the case of 
F2 mapping populations, F2 plants are used to determine genotype, and F2 families 
to determine phenotype. Recombinant inbred lines are produced by single-seed 
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descent. Recombinant inbred lines, such as, for example, the F9 RILs of A thaliana 
(eg. Lister and Dean, Plant 4, 745-750, 1993) will be known to those skilled in the 
art. Near isogenic lines (NILs) are used for fine mapping, and to determine the effect 
of a particular locus on treinspiration efficiency. An advantage of recombinant inbred 
5 lines and double haploid lines is that they are permanent populations, and as a 
consequence, provide for repUcation of the contribution of a particular locus to the 
transpiration efficiency phenotype. 

As for statistical methods, Single Marker Analysis (Point Analysis) is used to detect a 
10 locus in the vidnily of a single genetic marker. The mean transpiration efficiencies 
of a population of plants segregating for a particular marker, are compared according 
to the marker class. The difference between two mean transpiration efficiencies 
provides an estimate of the phenotypic effect of substituting one allele for another 
allele at the locus. To determine whether or not the inferred phenotypic effect is 
15 significeuitly different from zero, a simple statistical test, such as t-test or F-test, is 
used. A significant value indicates that a locus is located in the vicinity of the 
mcurker. Single point analysis does not require a complete molecular linkage map. 
The further locu^ is firom the marker, the less likely it is to be detected statistically, 
as a consequence of recombination between the marker and the gene. 

20 

In the Anova, t-test or GLM approach, the association between marker genotj^e and 
transpiration efficiency phenotype comprises: 

(i) classifying progeny of a segregating population of plants by marker genotype, 
such as for example, using RFLP, AFLP, SSCP, or loicrosatellite analyses, 

25 thereby establishing classes of plants; 

(ii) comparing the mean transpiration efficiencies of classes of plants in the 
segregating population, using a t-test, GLM or ANOVA; and 

(iii) determining the significance of the differences in the mean at (ii), wherein a 
significeait difference indicates that the marker is linked to the locus for 

30 transpiration efficiency. 



As will be known to those skilled in the art, the difference between the means of the 
classes provides an estimate of the effect of the locus in determining the 
transpiration efficiency of a class. 
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In the regression approach, the association between marker genotype and phenotype 

is determined by a process comprising: 

(i) assigning nmneric codes to marker genotypes; caid 
5 (ii) determining the regression value r for transpiration efficiency against the 
codes, wherein a signijBcant value for r indicates that the marker is linked to 
the locus for transpiration efficiency, and wherein the regression slope 
estimate of the effect of a particular locus on transpiration efficiency. 

10 For QTL interval mapping, the Mapmaker algorithm developed by Lincoln et aJ., 
Constructing genetic linkage maps with MAPMAKER/EXP version 3.0: A tutorial and 
reference manual. Whitehead Institute for Biomedical Research, Cambridge, MA, 
USA, 1993 , can be used. Th.e principle behind interval mapping is to test a model for 
the presence of a QTLat many positions between two mapped marker loci. This 

15 model is a fit of a presumptive QTL to transpiration efficiency, wherein the 
suitability of the fit is tested by determining the maximtmi likelihood that a QTL for 
transpiration efficiency lies between two segregating markers. For example, in the 
case of a QTL located between two segregating markers, the 2-loci marker genotypes 
of segregating progeny will each contain mixtures of QTL genotypes. Accordingly, it 

20 is possible to search for loci parameters that best approximate the distribution in 
transpiration efficiency for each marker class. Models are evaluated by computing 
the likelihood of the observed distributions with and without fitting a QTL effect. 
The map position of a QTL is determined as the maximimi likelihood firom the 
distribution of Hkelihood values (LOD scores: ratio of likelihood that the effect 

25 occurs by linkage: likelihood that the effect occurs by chance), calculated for .each 
locus. 



Interval mapping by regression (Haley and BCnott, Heredity 69, 315-324, 1992) is a 
simplification of the maximum likelihood method supra wherein basic QTL analysis 
or regression on coded marker genotypes is performed, except that phenotypes. are 
regressed on the probabiHty of a QTL genotype as determined from the linkage 
between transpiration efficiency and tiie nearest flanking markers. In most cases, 
regression mapping gives estimates of QTL position and effect that are ahnost 
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identical to those gLven by the maximulu likelihood method. The approximation 
deviates only at places where there are large gaps, or many missing genotypes. 

In the composite interval mapping (CIM) method Qansen and Stam, Genetics 136, 
5 1447-1455, 1994; Utz and Melchinger, 1994, supra; Zeng, Genetics 136, 1457-1468, 
1994), the analysis is performed in the usual way, except that the variemce from 
other QTLs are accounted for by including partial regression more power and 
precision than simple interval mapping, because the effects of other QTls are not 
present as residual variance. CIM can remove the bias that can be caused by the 
10 QTLs that are linked to the position being tested. 



PubUcly available software axe used to map a locus for transpu^ation efficiency. Such 
software include, for example, the following: 

(i) MapMaker/QTL f ftp ://genome. wi.mit.edu/pub/mapmaker3/l . for analyzing F2 
15 or backcross data using standard interval mapping; 

(ii) MQTL, for composite interval mapping in multiple environments or for 
performing simple interval mapping using homozygous progeny (eg. double 
haploids, or recombinant inbred hues); 

(iii) PLABQTL (Utz and Melchinger, PLABlocus Version 1.0. A computer program 
20 to map QTL, Institut fiir Pflanzenziichtung, Saatgutforschung und 

Populationsgenetik, Universitat Hohenheim, 70593 Stuttgart, Germany, 1995; 
http://vvww.um-hoheiiheim.de/'--ipspwww/soft.html) for composite interval 
mapping and simple interval mapping of a locus in mapping populations 
derived from a bi-parental cross by selfing, or in double haploids; 

25 (iv) QTL Cartographer (ht^://statgen.mcsu.edu/qtlcart/cartographer.html) for 
single-marker regression, interval mapping, or composite interval mapping, 
using F2 or backcross popxilations; 
(v) MapQTL (http;//www.cpro.dlo.nl/cbw/); Qgene for performing either single- 
marker regression or interval regression to map loci; €m.d 

30 (vi) SAS for detecting a locus by identifying associations between marker 
genotype and transpiration efficiency by a single marker analysis approach 
such as ANOVA, t-test, GLM or REG. 
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In a particularly preferred embodiment, QTL cartographer or MQTL is used to 
identify a locus associated with the transpiration efficiency of plants. 

Those skilled in the art will also be aware that it is possible to detect multiple 
5 interacting alleles or genes for a partic\ileLr trait, such as, for example, using 
composite interval mapping approaches. To achieve this end, the composite interval 
mapping may be repeated to look for additional loci. Alternatively, or in addition, 
two or more distinct regions of the genome can be nominated as cemdidate loci, and 
a gamete relationship matrix constructed for each cemdidate locus, and a 2-locus 

10 regression performed for each pair of loci, determining a best fit for the interacting 
effects between the two loci or aleles at those loci, including eoiy dominance or 
additive effects. The algorithm described by Carlborg et al., Genetics (2000) can be 
used for simultaneous mapping. In the present context, such an analysis is 
performed with reference to the segregation of transpiration efficiency phenotypes in 

15 the segregating population. 

Use of the ERECTA locus to enhance transpiration efficiency of plants 
As will be known to those skilled in the art, a single locus, if present in the genome 
of a pleint, can have a significant influence on the phenotype of the plant. For 
20 example, Grandillo et al,, Theor AppL Genet 99, 978-987, 1999, showed that for 
tomato a selection made from a total 28 loci determining fruit size and weight 
-explained 20% of the total phenotypic variance in this trait. 

Accordingly, a second aspect of the invention provides a method of selecting a plant 
25 having enhanced transpiration efficiency, comprising: 

(a) identifying a locus associated with genetic variation in transpiration efficiency in 
a plant; and 

(b) selecting a plant that comprises or expresses a gene that maps to the locus. 

30 By "enhanced transpiration efficiency is meant that the plant loses less water per 
imit of dry matter produced, or alternatively, produces an enhanced amoxmt of dry 
matter per unit of water transpired, relative to a counterpart plant. By "counterpart 
plant" is meant a plant having a similar or near-identical genetic background, such 
as, for exemiple, a near4sogenic plant, a sibUng, or parent. 
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In accordance with this aspect of the invention, a locus is identified by conventional 
locus mapping means, and/or by homology searching for genes that map to the 
ERECTA locus on chromosome 2 of the A thaliana genome, such as, for example, by 
5 searching for homologs of the A thaliana ERECTA allele or erecta allele as described 
herein above. 

Preferably, to select a plant that comprises or expresses the appropriate gene, 
marker-assisted selection (MAS) is used. As will be known to those skilled in the 
10 art, once a particular locus has been identified, genetic or physical meirkers that are 
linked to the locus can be readily identified and used to confirm the presence of the 
locus in breeding populations. For a locus that is flanked by two tightly-linked 
markers that recombine only at a low frequency, the presence of the flanking 
markers is indicative of the presence of the locus. 

15 

For marker-assisted selection, it is preferred that the marker is a genetic marker (eg. a 
gene or allele), or a physical marker (eg. leaf hairiness or pod shape), or a molecular 
mmker such as, for example, a restriction fragment length polymorphism (RFLP), a 
restriction (RAPD), ampUfied fragment lengfli polymorphism (AFLP), or a short 

20 sequence repeat (SSR) such as a microsatelUte, or SNP. It is also within the scope of 
the invention to utilize any hybridization probe or amplification primer comprising 
at least about 10 nucleotides in length derived from a chromosome region that is 
linked in the genome of a plant to the ERECTA locus, as a marker to select plants. 
Those skilled in the art will readfly be able to determine such prob.es or primers 

25 based upon the disclosure herein, particularly for those plant genomes which may 
have suflScient chromosome sequence in the region of interest in the genome (eg. A 
thaliana, rice, cotton, barley, wheat, sorghum, maize, tomato, etc). 

For flanking markers that are not tightiy linked, such that there is a large 
30 recombination distance there between, the presence of the appropriate gene is 
assessed by identifying those plants having both flanking markers and then selecting 
fiom those plants a plant having an enhanced transpiration efficiency. Natvually, 
the greater the distance between two markers, the larger the population of plants 
required to identify a plant having both markers, the intervening locus and a gene 
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within said locus. Those skilled in the art will readily be able to determine the 
population size required to identify a plant having a particular transpiration 
efficiency, bsised upon the recombination imits (cM) between two markers. 

5 Transpiration efficiency is determined by any means known to the skilled artisan. 
Preferably, transpiration efficiency is determined by measuring dry matter 
accumulation in the plant by gravimetric means, or by measuring water loss, or the 
ratio of CO2 assimilation rate to stomatal conductance. 

10 In a particularly preferred embodimenti the tremspiration efficiency is determined 
directly, by measuring the ratio of carbon fixed carbon assimilation rate) to water 
loss (transpiration rate). 

In an alternative embodiment, treaispiration efficiency is determined indirecfly from 

15 the carbon isotope discrimination value (A). Farquhar et a/., Aust /. Plant Physiol 
9,121-137, 1982, showed that carbon isotope discrimination (A; a measure of the 
extent to which the ^^C/^^C ratio of organic matter is less them that of CO2 in the 
source air), is an effective indirect measine of transpiration efficiency. The 
determination of transpiration efficiency in this manner is based upon the constancy 

20 of the atmospheric "C: ratio (about 98.19: 1.11) and the finding that, in Cg plants 
at least, ribulose bisphosphate carboxylase (Rubisco) enzymes discriminate against 
the use of "C. Thus, "GO2 is less efficientiy assimilated than "CO2, and diffuses 
lessthrough stomata in and out of the leaf. However, when the stomata become 
nearly closed, the diffusion of ^^C02 is more difficult to achieve and, at higher 

25 intracellular concentrations of ^^COz* this isotope is incorporated into 3- 
phosphoglycerate, and subsequentiy into dry matter. As a consequence, carbon 
isotope discrimination (A) is greatest when the overall CO2 assimilation rate during 
photosynthesis (A) is small, and stomatal conductance (g^^) to water vapor is large. 
This relationship is represented by the following algorithm: 

30 A(°/oo)=27-36A/(G„xCJ 

wherein G^ is the ambient GO2 concentration (ie. ["G02+"G02]). 



For a G3 plant that exhibits a value in the range of about 4.5 °/oo to about 6.7 °/oo for 
the term 36A/(Gw x GJ, a 1 °/oo change in carbon isotope discrimination (A) 
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corresponds to a change in transpiration efficiency in the range of about 22% to 
about 15%, respectively. 

The negative relationship between carbon isotope discrimination (A) and 
5 transpiration efficiency has been established for many plant species, including 
wheat (Farquhar and Richards, Aust J, Plant Physiol 11, 539-552, 1984; Farquhar et 
al,, Ann. Rev. Plant Physiol. 40,388-397, 1989), Stylosonthes (Thumma et al., Proc. 9'^ 
Aust Agronomy Conf., Wagga Wagga New South Wales, Australia, 1998), cotton, 
barley, and rice. Accordingly, a lower carbon isotope discrimination (A) value for a 
10 test plant relative to a coimterpart plant is indicative of enhanced transpiration 
efficiency. 

Altematively, or in addition, transpiration efficiency is determined by another 
indicator, such as, for example, leaf temperature, ash content, mineral content, or 

15 specffic leaf weight (dry matter per unit leaf area). For example, specffic leaf weight 
. is positively correlated with transpiration efficiency in peanuts and other species 
(Virgona et al., Aust /. Plant Physiol, 17. 207-214, 1990; Wright et al., Crop Sci 34, 
92-97, 1994). Accordingly, a higher specific leaf weight or higher carbon gain rate 
for a test plant relative to a counterpart plant is indicative of enhanced transpiration 

20 efficiency of the test plant. 

The presence of the locus can be estabUshed by hybridizing a probe or primer that is 
linked to an ERECTA locus, such as, for example, a probe or primer that hybridizes 
to the identified chromosome 2 region of A thaliana or the identified chromosome 6 
25 region of rice. 

Preferably, the presence of the locus is estabUshed by hybridizing a probe or primer 
derived from any one or more of SEQ ID Nos: 1, 3, or 5, or from a homologous gene 
in another plant, or a complementary sequence to such a sequence, to genomic DNA 
30 firom the plant, and detecting the hybridization using a detection means. 

In one embodiment, detection of the hybridization is performed preferably by 
labelling a probe with a reporter molecule capable of producing an identifiable 
signal, prior to hybridization, and then detecting the signal after hybridization. 
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Preferred reporter molecules include radioactively-labelled nucleotide triphosphates 
and biotinylated molecules. Preferably, variants of the genes, exemplified herein, 
including genomic equivalents, are isolated by hybridisation under moderate 
stringency or more preferably, imder high stringency conditions, to the probe. 

5 

Alternatively, or in addition, hybridization may be detected using any format of the 
polymerase chain reaction (PGR), including AFLP. For PGR, two non-complementary 
nucleic acid primer molecules comprising at least about 20 nucleotides in length, 
and more preferably at least 30 nucleotides in length are hybridized to different 
10 strands of a nucleic acid template molecule, and specific nucleic acid molecule 
copies of the template are amplified enzymaticaUy. Several formats of PGR are 
described in McPherson et al,, In: PCR A Practical Approach., JKL Press, Oxford 
University Press, Oxford, United Kingdom, 1991, which is incorporated herein by 
reference. 

15 

For enhancing the transpiration efficiency of a plant wherein the locus is 
polymorphic, such as, for example, an allele, the method supra is modified to 
include the detection of the specific allele(s) Unked to the desired enhancement. 
According to this embodiment, there is provided a method of selecting a plant 
20 having enhanced transpiration efficiency, comprising: 

(a) identifying a locus eissociated with genetic variation in transpiration 
efficiency in a plant; 

(b) identifying a polymorphic marker within said locus that is linked to 
enhanced transpiration efficiency; and 

25 (c) selecting a plant that comprises or expresses the marker. 

Stcmdeird meems known to the skilled artisan are used to identify a marker within the 
locus that is linked to enhanced transpiration efficiency. A population of plants that 
is segregating for the polymorphic marker is generally used, wherein the 
30 transpiration efficiency phenot3^e of plants is then correlated or associated with the 
presence of a particular allelic form of the marker. As exempUfied herein, near- 
isogenic or recombinant inbred lines of plants were screened to segregate alleles at 
the ERECTA locus and to correlate enhanced transpiration efficiency with the 
presence of the ERECTA allele as opposed to an erecta allele. 
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Suitable markers include any one or more of the markers described herein to be 
suitable for MAS. 

5 Preferably, the selection of plants in accordance with these embodiment includes the 
additional step of introducing the locus or polymorphic marker to a plant such as, 
for example, by standard breeding approaches or by recombinant means. This may 
be carried out at the same time, or before, selecting the locus or polymorphic marker. 

10 Recombinant meems generally include introducing a gene construct comprising the 
locus or marker into a plant cell, selecting transformed tissue and regenerating a 
whole plant fix)m the transformed tissue explant. Means for introducing recombinant 
DNA into plant tissue or cells include, but are not limited to, transformation using 
CaClz and variations thereof, in particular the method described by Hanahan (1983), 

15 direct DNA uptake into protoplasts (Krens ef cd. Nature 296, 72-74, 1982; Paszkowski 
et al., EMBOJ. 3, 2717-2722, 1984), PEG-mediated uptake to protoplasts (Armstrong 
et al., Plant Cell Rep. 9, 335-339, 1990) nucroparticle bombardment, electroporaidon 
(Fromm et al.. Prod. Natl Acad. Sci, (USA), 82, 5824-5828, 1985), microinjection of 
DNA (Crossway et al., MoL Gen. Genet 202, 179-185, 1986), microparticle 

20 bombardment of tissue explants or cells (Christou et al. Plant Physiol. 87, 671-674, 
1988; Sanford, Part ScLTechnol. 5, 27-37, 1988). vacuum-infQtration of tissue with 
nucleic acid, or in the case of plants, T-DNA-mediated transfer from Agrobacterium 
to the plant tissue as described essentially by An al,, EMBO /. 4, 277-284, 1985; 
Herrera-Estrella et al., Herrera-Estella et al.. Nature 303, 209-213, 1983; Herrera- 

25 Estella el al., EMBOJ. 2, 987-995, 1983; or Herrera-Estella et al.. In: Plant Genetic 
Engineering, Cambridge University Press, N.Y., pp 63-93, 1985.. 

For microparticle bombardment of cells, a micfoparticle is propelled into a cell to 
produce a transformed cell. Any suitable ballistic cell transformation methodology 
30 and apparatus can be used in performing the present invention. Exemplary 
appaiatus and procedinres are disclosed by Stomp et al. (U.S. Patent No. 5,122,466) 
and Sanford and Wolf (U.S. Patent No. 4,945,050). When using ballistic 
transformation procedures, the gene construct may incorporate a plasmid capable of 
repUcating in the cell to be transformed. 
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Examples of microparticles suitable for use in such systems include 1 to 5 micron 
gold spheres. The DNA construct may be deposited on the microparticle by any 
suitable technique, such as by precipitation. 

5 

A whole plant may be regenerated fix>m the transformed or transfected cell, in 
accordance with procedures weU known in the art Plant tissue capable of 
subsequent clonal propagation, whether by organogenesis or embryogenesis, may be 
transformed with a gene construct of the present invention and a whole plant 

10 regenerated therefrom. The particular tissue chosen will vary depending on the 
clonal propagation systems available for. and best suited to, the particular species 
being transformed. Exemplary tissue targets include leaf disks, pollen, embryos, 
cotyledons, hypocotjds. megagametophytes. callus tissue, existing meristematic 
tissue (eg., apical meristem, axiUaiy buds, and root meristems), and induced 

15 meristem tissue (eg., cotyledon meristem and hypocotyl meristem). 

The term "organogenesis", as used herein, means a process by which shoots and 
roots are developed sequentially from meristematic centres. 

20 The term "embryogenesis". as used herein, means a process by which shoots and 
roots develop together in a concerted fashion (not sequentially), whether from 
somatic cells or gametes. 



The generated transformed plants may be propagated by a variety of means, such as 
25 by clonal propagation or classical breeding techniques. For example, a first 
generation (or Tl) fransfoimed plant may be selfed to give homozygous second 
generation (or T2) transformant, and the T2 plants further propagated tiirough 
classical breeding techniques. 

30 The generated transformed organisms contemplated herein may take a variety of 
forms. For example, they may be chimeras of ti^sformed cells and non-fransformed 



31 

cells; clonal transformaiits (eg., all cells transformed to contedua the expression 
cassette); grafts of transformed and xmtransformed tissues, (eg., in plants, a 
transformed root stock grafted to an untrcoisformed scion ). 

5 Alternatively, the transformed plants are produced by an in planta transformation 
method using Agrobacterium tumefaciens, such as, for example, the method 
described by Bechtold et cd„ CR Acad ScL (Paris, Sciences de la vie/ life Sciences) 
316, 1194-1199, 1993 or Qough et al„ Plant J 16: 735-74, 1998, wherein A 
tumefaciens is appUed to the outside of the developing flower bud and the binary 
10 vector DNA is then introduced to the developing microspore and/or macrospore 
and/or the developing seed, so as to produce a transformed seed. Those skilled in 
the art will be aware that tiie selection of tissue for use in such a procedure may 
vary, however it is preferable generally to use plant material at the zygote formation 
stage for in planta transformation procediures. 

15 

Identification of genes for determing the transpiration efficiency of a plant 
As exemplified herein^ the inventors also identified specific genes or cQleles that are 
linked to the ERECTA locus and determine the transpiration efficiency of a plant. 
More particularly, the transpiration efficiencies of near-isogenic lines, wherein each 

20 line carries a mutation within a target locus in the region of a locus associated with 
transpiration efficiency, were determined, thereby providing the genetic contribution 
of that locus to transpiration efficiency. This analysis allowed the inventors to assess 
the genetic contribution of particular alleles to transpiration efficiency, thereby 
determining allelic variants that are linked to a particular transpiration efficiency. 

25 Thus, the elucidation of the ERECTA locus for transpiration efficiency in plants 
facilitated the fine mapping and determination of allelic variants that determine 
transpiration efficiency. 

Accordingly, a third aspect of the invention provides a method of identifying a gene 
30 that determines the transpiration efiEiciency of a plant 



In one embodiment, the method comprises: 
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(a) identifying a locus associated with genetic veuciation in transpiration efficiency 
in a plant; 

(b) identifying a gene or allele that is linked to said locus, wherein said gene or 
allele is a candidate gene or eJlele for determining the transpiration efficiency 

5 of a plant; and 

(c) determining the transpiration efficiencies of a panel of near isogenic plants, 
wherein not all members of said panel comprise said gene or allele, and 
wherein variation in transpiration efficiency between the members of said 
panel indicates that said gene is involved in determining transpiration 

10 efficiency. 



In another embodiment, the method comprises: 

(a) identifying a locus associated with genetic variation in transpiration efficiency 
in a plant; 

15 (b) identifying multiple aUeles of a gene that is Unked to said locus, wherein said 
gene is a candidate gene involved for determining the transpiration efficiency 
of a plant; and 

(c) determining the transpiration efficiencies of a pemel of near isogenic plants, 
wherein each member of said panel comprises at least one of said multiple 
20 alleles, wherein variation in transpiration efficiency between the members of 

said panel indicates that said gene is involved in deteriaiiiing transpiration 
efficiency. 

In the present context, the term "near isogenic plants'* shall be taken to mean a 
25 population of plants having identity over a substantial proportion of their genomes, 
notwithstanding the presence of sufficiently few differences to permit the 
contribution of a distinct allele or gene to the transpiration efficiency of a plant to be 
determined by a comparison of the transpiration efficiency phenot3^es of the 
population. As will be known to the skilled artisan, recombinant inbred Unes, lines 
30 produced by introgression of a gene followed by several generations of backcrossing, 
or siblings, are suitable near-isogenic lines for the present purpose. 

Preferably, the identified gene or allele identified by the method described in the 
preceding paragraph is selected from the group consisting of ERECTA gene, Erecta 
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edleles, homologs of ERECTA, , wherein said homologs are from plants species other 
than A thaJiana, 

In a particularly preferred embodiment, the identified gene or edlele will comprise a 
5 nucleotide sequence selected from the group consisting of: 

(a) a sequence having at least about 55% identity to a sequence selected from the 
group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 
7, and SEQ ID NO: 9; 

(b) a sequence encoding an amino acid sequence having at least about 55% 
10 identity to an amino acid sequence selected from the group consisting of SEQ 

ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8 and SEQ ID NO: 10; 
and 

(c) a sequence complementary to (a) or (b). 

15 Preferably, the percentage identity is at least about 59-61% or 70% or 80%, more 
preferably at least about 90%, and even more preferably at leas.t about 95% or 99%. 
In a particularly preferred embodiment, the identified gene or allele comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 
20 3, SEQ ID NO: 5, SEQ ID NO: 7, and SEQ ID NO: 9; 

(b) a sequence encoding an amino acid sequence selected from the group 
consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8 and 
SEQIDNO: 10;and 

(c) a sequence complementary to (a) or (b), 

25 

Enliancement of transpiration efficiency using isolated genes 

The identified gene or alleles, including any homologs from a plant other than A, 
thaliana, such as, for example, the wild-type ERECTA allele, or a homolog thereof, is 
useful for the production of novel plants. Such plants are produced, for example, 
30 using recombinant techniques, or traditional plant breeding approaches such as by 
introgression. 

Accordingly, a fourth aspect of the present invention provides a method of 
enhancing the transpiration efficiency of a plant comprising ectopically expressing 
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in a plant an isolated gene comprising a nucleotide sequence that is homologous to a 
protein-encoding region of a gene that is linked to the A ihaliana ERECTA locus on 
chromosome 2. 

5 In a particularly preferred embodiment, the isolated gene comprises a nucleotide 
sequence selected from the group consisting of: 

(a) a sequence having at least about 55% identity to a sequence selected from the 
group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 
7, and SEQ ID NO: 9; 

10 (b) a sequence encoding an amino acid sequence having at least about 55% 
identity to an amino acid sequence selected from the group consisting of SEQ 
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8 and SEQ ID NO: 10; 
and 

(c) a sequence complementary to (a) or (b). 

15 

Preferably, the percentage identity is at least about 59-61% or 70% or 80%, more 
preferably at least about 90%, and even more preferably at least about 95% or 99%. 

In a particularly preferred embodiment, the isolated gene or allele comprises a 
20 nucleotide sequence selected from the group consisting of: 

(a) a sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 
3, SEQ ID NO: 5, SEQ ID NO: 7, and SEQ ID NO: 9; 

(b) a sequence encoding an amino acid sequence selected from the group 
consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NQ: 6, SEQ ID NO: 8 and 

25 SEQIDNO: 10;and 

(c) a sequence complementary to (a) or (b). 

To ectopically express the isolated gene in a plant, the protein-encoding portion of 
the gene is generally placed in operable connection with a promoter sequence that is 
30 operable in the plant, which may be the endogenous promoter or edternatively, a 
heterologous promoter, and a transcription termination sequence, which also may be 
the endogenous or an heterologous sequence relative to the gene of interest. The 
promoter and protein-encoding portion and transcription termination sequence are 
generally provided in the form of a gene construct, to facilitate introduction and 
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maintenance of the gene in a plant where it is to be ectopically expressed. Numerous 
vectors suitable for introducing genes into pleuits have been described and are 
readily available. These may be adapted for expressing an isolated gene in a plant to 
enhance transpiration efficiency therein. 

5 

Reference herein to a "promoter** is to be taken in its broadest context and includes 
the transcriptional regulatory sequences of a classical eukaryotic genomic gene, 
including the TATA box which is required for accurate transcription initiation, with 
or without a CCAAT box sequence and additional regulatory elements (ie. upstream 

10 activating sequences, enhancers and silencers) which alter gene expression in 
response to developmental and/or extemal stimuli, or in a tissue-specific manner. In 
the present context, the term "promoter" is also used to describe a S3aithetic or fusion 
molecule, or derivative which confers, activates or enhances expression of said sense 
molecule in a cell. Preferred promoters may contain additional copies of one or 

15 more specific regulatory elements, to further enhance expression and/or to alter the 
spatial expression and/or temporal expression of a nucleic acid molecule to which it 
is operably connected. For example, copper-responsive regulatory elements may be 
placed adjacent to a heterologous promoter sequence driving expression of a nucleic 
acid molecule to confer copper inducible expression thereon. 

20 

Placing a nucleic acid molecule under the regulatory control of a promoter sequence 
means positioning said molecule such that expression is controlled by the promoter 
sequence. A promoter is usually, but not necessarily, positioned upstream or 5' of 
the protein-encoding portion of the gene th^t it regulates. Furthermore, the 

25 regulatory elements comprising a promoter are usually positioned within 2 kb of the 
start site of transcription of the structured protein-encoding nucleotide sequences, or 
a chimeric gene comprising same. In the construction of heterologous 
promoter/structural gene combinations it is generally preferred to position the 
promoter at a distance from the gene transcription start site that is approximately the 

30 same as the distance between that promoter and the gene it controls in its natural 
setting, ie., the gene from which the promoter is derived. As. is known in the art, 
some variation in this distance can be accommodated without loss of promoter 
function. Similarly, the preferred positioning of a regulatory sequence element with 
respect to a heterologous gene to be placed xmder its control is defined by the 
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positioning of the element in its natural setting, ie., the genes from which it is 
derived. Agedn, as is known in the axt, some variation in this distance can also 
occiu:. 

5 Promoters suitable for use in gene constructs of the present invention include those 
promoters derived from the genes of viruses, yeasts, moulds, bacteria, insects, birds, 
mcunmals and plants which are capable of functioning in plant cells, including 
monocotyledonous or dicotyledonous plants, or tissues or organs derived from such 
cells. The promoter may regulate gene expression constitutively, or differentially 
10 with respect to the tissue in which expression occxu?s or, with respect to the 
developmented stage at which expression occurs, or in response to extemal stimuli 
such as physiological stresses, pathogens, or metal ions, amongst others. 

Examples of promoters useful in performing this embodiment include the CaMV 35 S 
15 promoter, NOS promoter, octopine synthase (OCS) promoter, Arabidopsis thaliana 
SSU gene promoter, napin seed-specific promoter, SCSV promoter, SCBV promoter 
and the like. In addition to the specific promoters identified herein, cellular 
promoters for so-called housekeeping genes, including the actin promoters, or 
promoters of histone-encoding genes, axe useful. 

20 

The term "terminator" refers to a DNA sequence at the end of a transcriptional unit 
which signals termination of transcription. Terminators are 3'-non-translated DNA 
sequences containing a polyadenylation signal, that facilitate the addition of a 
polyadenylate sequence to the 3'-end of a primeuy trcmscript. Terminators active in 
25 cells derived from viruses, yeasts, riioulds, bacteria, insects, birds, mammals and 
plants are known and described in the literature. They are isolatable from bacteria, 
fungi, viruses, cmimals and/or plants. 

Examples of terminators particularly suitable for use in the gene constructs of the 
30 present invention include the nopaline synthase (NOS) gene terminator of 
Agrobacterium tumefaciens, the terminator of the Cauliflower moseiic virus (CaMV) 
35S gene, the zein gene terminator from Zea mays, the Rubisco small subunit (SSU) 
gene terminator sequences and subclover stimt virus (SCSV) gene sequence 
terminators, amongst others. 
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Those skilled in the art will be aware of additional promoter sequences and 
terminator sequences that may be suitable for use in performing the invention. Such 
sequences may readily be used without any undue experimentation. 

5 

Preferably, the gene construct further comprises an origin of replication sequence for 
its replication in a specific cell type, for example a bacterial cell, when said gene 
construct is required to be maintained as an episomal genetic element (eg. plasmid 
or cosmid molecule) in said cell. Preferred origins of replication include, but are not 
10 limited to, the/Z-ori and coiEl origins of replication. 

Preferably, the gene construct further comprises a selectable marker gene or genes 
that are functional in a cell into which said gene construct is introduced. 

15 As used herein, the term "selectable marker gene" includes any gene which confers a 
phenotj^e on a cell in which it is expressed to facilitate the identification and/or 
selection of cells which are transfected or transformed with a gene construct of the 
invention or a derivative thereof. 

20 Suitable selectable marker genes contemplated herein include . the ampicillin 
resistance (Amp*), tetracyclin-resistance gene (Tc'), bacterial kanamycin resistance 
gene (Kan^, phosphinothricin resistance gene, neomycin phosphotramsferase gene 
(npfll), hygromycin resistance gene, p-glucuronidase (GUS) gene, chloramphenicol 
acetyltransferase (CAT) gene, and luciferase gene, amongst others. 

25 

In a related embodiment, the invention extends to the use of an isolated gene 
comprising a nucleotide sequence that is homologous to a protein-encoding region of 
a gene of A tiialiana that is positioned between about 46cM to about 50.74cM on 
chromosome 2 in the preparation of a gene construct for enhancing the transpiration 
30 efficiency of a plant. 

In an alternative embodiment of the invention, the trcmspiration efficiency of a plem.t 
is enhanced by classical breeding approaches, comprising introgressing the isolated 
gene into a plant For introgression of a gene, the gene is transferred firom its native 
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genetic background into another genetic background using standard breeding, for 
exeunple, a gene that enhemces transpiration efficiency in a progenitor such as a 
diploid cotton or diploid wheat may be transferred into a commercial tetraploid 
cotton or hexaploid wheat, respectively, by steoidard crossing; followed by several 
5 generations of back-crossing to remove the genetic background of the progenitor. 
Naturally, continued selection of the gene of interest is required, such as, for 
example, faciUtated by the use of markers. 

A fifth aspect of the present invention provides a plant having enhanced 
10 transpiration efficiency, wherein said plant is produced by a method described 
herein. 

The present invention is further described with reference to the following non- 
limiting examples. 

15 

EXAMPLE 1 

12Q13Q discrimination as a marker for screening genetic variation 
in transpiration efficiency. 
20 Experimental conditions and sampling procedures were established to allow the 
control of many factors, other than genetic, that influence transpiration efficiency at 
the level of individual leaves and plants. These factors fall into several categories: (a) 
characteristics of the seedlings micro-environment: temperature, light, humidity, 
boundary layer aroimd the leaves, root growth conditions; (b) developmented and 
25 morphological effects that modify gas exchange and C metabolism and therefore 
carbon isotopic signature (eg age, stage, posture); and (c) seed effects. 

We developed high resolution m€Lss-spectrometer techniques for measuring C isotope 
ratios in whole tissues or carbon compounds such as soluble sugcurs -ie a measure of 
30 integrated transpiration efficiency over the plant's life or over a day, respectively, 
and also for measuring instantaneous transpiration efficiency during gas exchange. 



This means: 
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• 0.1 per Tnil analytical precision in the measurement of the isotopic 
composition of leaf carbon. Discnmination, {-), is approximately the isotope 
ratio of carbon in source CO2 minus that of plant organic carbon. In a 
particular experiment, the source CO2 is conunon to all genot3^es. 

5 • 0.1 per mil biological precision, that is variation between repUcated seedlings, 
grown in soil, either in growth chambers or in glasshouses with CO2, 
humidity and temperature control (corresponding to approximately 1.5% 
variation in treuispiration efEiciency). 

• The abihty to grow and screen large batches of seedlings in glasshouses or 
10 growth chambers (up to 1500], imder standardised leaf and root growth 

conditions, to a rosette size of several cm within 2-3 weeks allowing 
individual measurements, on the same plcuit, of isotope ratios and also of the 
underlying properties (eg in situ meastu^ment of leaf temperature by infra-red 
thermometry as a measure of stomatal conductance; chlorophyll fluorescence; 
15 leaf expansion). 

EXAMPLE 2 

Natural genetic variation in transpiration efficiency inArabidopsis thaliana 
A thaliana ecotj^es were screened for leaf A under glasshouse conditions. There 

20 was a large spread of values (corresponding to approximately 30 % genetic variation 
in transpiration efficiency). However, large enviroxunental effects were noted. A 
few contrasted ecotj^es were selected at the two extremes of the range of A values 
and compared under various conditions of irradiance (150 to 500 ^E m"V^), light 
spectnun (Red/Far-Red ratios) and air humidity (60 to 90%) while roots were always 

25 well watered. The magnitude of gen^etic differences in transpiration efBciency was 
very much influenced by environmental conditions. This was in part due to 
variations among ecot3^es in the dependence of photosynthesis on light and vapotu 
pressure deficit. Genetic differences were maximal under a combination of high light 
and low humidity, in growth chambers. 

30 

The ecotypes Columbia (Col) and Landsberg erecta (Ld-er) have extreme carbon 
isotope discrimination values, with Col always having smaller values than Ld-er, and 
thus a greater transpiration efficiency. 
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EXAMPLES 

Identification of a locus associated with transpiration efficiency in A thcdiana 
Quantitative Trait Loci (QTL) analysis of the Lister and Dean's (1993) Recombinant 
Inbred Lines (later referred to as RILs) was performed to identify and map a locus' 
5 associated with carbon isotope discrimination (AJ. The RILs were from a cross 
between Col~4 and Ler-0, Om: analysis confirmed the importance of genes around 
the ER locus, and a role for genes other than EREGTA in conferring transpiration 
efficiency on A thaliana. 

10 More particularly, 300 RI mapping lines between Col and Ler ecotypes, available at 
the Arabidopsis Stock Centre, were generated from a cross between the Arabidopsis 
ecotypes Colinnbia (Col4) emd Landsberg erecta (Ler-0 carrying erl) (Lister and Dean, 
1993), using Columbia as the male parent. A subset of 100 of these lines, chosen as 
the most densely and reliably mapped were used in the present analysis. 

15 

The seeds were multiplied in a glasshouse in an attempt to minimize confoimding 
seed effects in our comparisons. Large numbers of seeds were obtained for most hnes 
except for a few, including Col4 parent, which had to be re-ordered following low 
seed viabihty of the original sample sent by the Stock Centre. The seeds harvested in 
20 these propagation runs were used throughout all our experiments to date. 

Loci were analysed using two programs, QTL cartographer and MQTL. These 
programs compute statistics of a trait at each marker position, using a range of 
methods [linear regression (LR), stepwise regression (SR), and likelihood approaches 

25 (Single interval mapping (SIM) which treats values at individual markers as 
independent values, and composite interval mapping (CIM) which allows for 
interactions between markers and associated locus)]. By nature each of these 
methods has some biases and embedded assmnptions, hence the importance of 
analysing data with more than one program. Only results that were consistent 

30 between the two programs, and robust to additions or deletions to the set of 
background markers used for composite interval mapping are reported below. 

Initial QTL analysis was done in parallel to seed multiplication on a subset of 40 
lines for which enough seeds were sent. Once all seeds had been mtdtiplied this was 
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repeated on the full set of 100 lines. These two analyses indicated the existence of a 
locus for carbon isotope discrimination (A), that maps to the region including the 
ERECTA locus on chromosome 2, at approximately 46-51 cM (Table 1, nm 1&2), 

5 Given the complexity and integrative nature of A as a physiological trait, such a 
small number of loci associated with the trait was not expected. Subsequent 
experiments were therefore designed to test these results and assess their stability 
across the reuage of environmental conditions known for their effects on gene 
expression related to A (see above). QTL einalysis was repeated on several completely 

10 independent data sets obtained under highly controlled conditions in glasshouses or 
growth chambers, where either air humidity, photoperiod or irradiance (amount, 
diurnal pattern, day to day variation) was varied. Depending on the experiment^ all 
100 recombinants inbred lines were included or ordy the subset of Unes with cross- 
overs on chromosome 2. These experiments confirmed that genetic variation in A 

15 could be mostly ascribed to a portion of chromosome 2 (Table 1) between about 46- 
50.7 cM. 

When RILs were sorted graphicedly according to c8ucbon isotope discrimination and 
their genotype at the ER marker (50.64 cM) and its vicinity (Ld-eri genotype or Col- 

20 ER genotype), lines which were Ld-er at the ERECTA marker ranked mostly at the 
high end of carbon isotope discrimination values. In contrast, lines having a Col- 
ERECTA marker genotype ranked mostly at the low end of carbon isotope 
discrimination values (data available on request). In the middle of the range of 
carbon isotope discrimination values, there W€is some overlap between the two sets 

25 of lines. Some lines were always at an extreme (in all 18 experiments performed), 
while the ranking of other lines was more unstable. These data indicate a locus for 
transpiration efficiency, as determined by the carbon isotope discrimination value, 
in the vicinity of the ERECTA locus on chromosome 2 (Table 1). This locus most 
likely involves the ER gene. Depending on the positions of cross-overs between Ld-er 

30 and Col, recombination between ERECTA and one or more of the other genes 
influences the transpiration efficiency phenotype of the progeny. 

EXAMPLES 

Determination of a role for the ERECTA gene in regulating transpiration efficiency 
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We compared Col and Ler ecolypes with near-isogenic mutant lines for the erecLa 
gene, to examine a possible role of the ERECTA gene in determining carbon isotope 
discrimination (A). 

5 Plants expressing the wild type ERECTA gene (SEQ ID NO: 1), or an erecta mutant 
allele in the Colmnbia backgroimd (eg. Col-erlOl to -erlOS; or Col-erl08 to -erl23), 
have been publicly described. Two of these mutants were available for comparison 
to the isogenic or near-isogenic lines (Table 2). 

10 Col4, the other parental line {ER) for Lister and Dean's RILs was systematically 
included in the comparison. Where possible, other Col "ecotypes" were eJso 
included, (eg. ColO , Col3-7), to assess their similarity with respect to carbon isotope 
disciimination, especially compared to the RIL parental ecotype Col4« 

15 The results of these comparisons are described in Table 3. Data indicate that the 
differences in carbon isotope discrimination values between er and jBjR lines for 15 
different experimental runs corresponding to growth imder low, to high hght (100 to 
800 /xE m*^ s*^), low to high hiunidily (40 to 85%), short to long days (8, 10, 24hrs), 
normal to hi^ temperatures (22/20°C to 28/20 °C). 

20 

As expected, the spread of carbon isotope discrimination values eonong lines varied 
with environmental conditions. Lines carrying er mutations have a greater carbon 
isotope discrimination value overall than those having the ER wild tj^e gene (see 
Table 3, column 1), indicative of a lower water use-efficiency. There is usually litde 

25 difference in C isotopic discrimination between the various Col lines, (see the 
similar averages obtained for colxunns 2, 3, and 4 in Table 3, wherein erl05 is 
compared to 3 different Col ecotypes, ColO, Col4 and 3176 or Coll). When present, 
the erl05 mutant always has the greatest carbon isotope discrimination value of all 
lines, including erl and er2 (columns 2-4 compared to columns 5-6 in Table 3, or 

30 column 8 compared to colvunn 9 in Table 3). The value measured in the erl 05 
mutant is always significantly greater than in the ER isogenic line (colvunn 4 in Table 
3). The value measured in erl (Landsberg parental line NW20) is usually also greater 
than that in the ER lines 3177 (near isogenic, colvunn 6 of Table 3), and to a lesser 
extent Col4 (Columbia parental line, column 7 of Table 3). These observations give 
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direct evidence that the ERECTA gene plays a significant role in determining genetic 
differences in carbon isotopic discrimination in Arabidop^is. 

This conclusion is independently confirmed by leaf gas exchange measurements that 
5 allow the direct measure of transpiration efficiency (ratio of net CO2 fixation to water 
loss; column 4 in Table 4; Figiures la-lc, 2a-2c). Measurements on mature leaves 
reveal that ER lines are characterised by a greater ratio of CO2 assimilation to water 
loss than lines carrying er mutations. This is most obvious when comparing the pair 
Coll/ erl05 with a 21% greater transpiration efficiency (ratio A/E) in Coll than erlOS, 
10 or the pair Coll/er2 with a 16% greater treuispiration efficiency in Coll, Consistent 
with the measiurements of carbon isotope discrimination, the effect er/ER is relatively 
smaller in the Ld backgroimd (9% greater ratio A/E in Ld-ER (3177) than the Ld- 
erl(NSW20) background. 

15 Also consistent with the carbon discrimination measurements, is the 20% difference 
in transpiration efficiency between the two RILs parental lines (4.06 and 3.38 
mmolC/molH20 in Col4-ER and Ld-erl, respectively). 

The fact that of all 3 erecta mutants examined, erlOS has the most extreme carbon 
20 discrimination and transpiration efficiency phenotypes suggests that the erl05 
mutation affects a more crucisd port of the ERECTA gene than er2 or erlm This is 
consistent with the pubUshed data on the erlOS mutant. This mutation corresponds 
to the insertion of a large "foreign insert" in the ERECTA gene (1200bp). The 
insertion totally inhibits transcription of the gene and causes the strongest erecta 
25 phenotype of all erecta mutants isolated in Col (with respect to inflorescence 
clustering and siUque width and shape. Alternatively, or in addition, data indicate 
that erecta mutations have a stronger effect on carbon isotope discrimination values 
in a Colimibia genetic backgroxmd than in a Landsberg backgroimd (comparison of 
phenot5rpic effects of erlOS and erl), impl5dng that other genes, polymorphic 
30 between Landsberg and Columbia ecotypes, interact with ERECTA in determining 
transpiration efficiency. This could also account for the greater difference in 
transpiration efficiency between er/ER fines in Col background than in a Ld 
background (see above. Table 4). Alternatively, or in addition, data indicate that the 
erecta mutation is not the only mutation present in the erlOS mutant. For example, 
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the mutagenized Col seeds may have carded the gll mutation, induced by the fast 
neutron irradiation, that also contributes to the phenotype observed. 

A comparison of transcript profiles in er/ER isogenic lines (in both Col and Ld 
backgroimd) allows determination of the involvement of additional genes to 
ERECTA and the effect of environment on their expression. 
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TABLE 2 



D a r*1c 0 T*o 1 1 ¥1 ffl 


IVf uf'a ti nn 


Sfnr*1r f^finfTO nanrm 


Stock Centre Name 


Landsberg 


erl 


CS20orNW20'' 


3177orCSl63 










Columbia 


er2^ 


3401 


Coll or 3176 










Columbia 


erl 05 " 




Col3 with gll marker 
or Colo 











a. NW20 is an Ler parent for Lister and Dean's recombinant lines, carr3nlng the erl 



5 mutation. Lines 3177 or CS163 are the closest isogenic ER lines. 

b, er2 is an er allele identified by RSd6i in Col background. Coll or 3176 are the 
closest Col near-isogenic lines. 

c, erlOS was isolated from a fast-neutron-iiradiated Col seed popvdation (Torii et al., 
1996). 

10 d, Col4, the Col p€u:ent for the Lister and Dean's parent was systamically included in 
all comparisons. 
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SEQUENCE LISTING 

5 

<110> The Australian National University 
Masle^ Josette 
10 Farquhar, Graham 

Gilmore, Scott 

15 

<120> Method of producing plants having enhanced transpiration 
efficiency and plants produced therefrom I 

20 

<130> 500592/MRO 



25 <160> 10 



<170> Patentin version 3,1 

30 



<210> 1 
35 <211> 3176 
<212> DNA 

<213> Arabidopsis thaliana ERECTA allele 

40 

<220> 
45 <221> CDS 

<222> (77) . • (3004) 
<223> 

50 



<400> 1 

gtttcttctt catggagact tgaaagcttt taaagtatat ctaaaaacgc agtcgtttta 
55 60 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



agactgtgtg tgagaa atg get ctg ttt aga gat att gtt ctt ctt ggg ttt 
112 

Met Ala Leu Phe Arg Asp lie Val Leu Leu Gly Phe 
15 10 

etc ttc tgc ttg age tta gta get aet gtg act tea gag gag gga gca 
160 

Leu Phe Cys Leu Ser Leu Val Ala Thr Val Thr Ser Glu Glu Gly Ala 
15 20 25 

acg ttg ctg gag att aag aag tea ttc aaa gat gtg aac aat gtt ctt 
208 

Thr Leu Leu Glu lie Lys Lys Ser Phe Lys Asp Val Asn Asn Val Leu 
30 35 40 

tat gae tgg aca act tea cct tct teg gat tat tgt gtc tgg aga ggt 
256 

Tyr Asp Trp Thr Thr Ser Pro Ser Ser Asp Tyr Cys Val Trp Arg Gly 

45 50 55 60 

gtg tct tgt gaa aat gtc acc ttc aat gtt gtt get ctt aat ttg tea 
304 

Val Ser Cys Glu Asn Val Thr Phe Asn Val Val Ala Leu Asn Leu Ser 
65 70 75 

gat ttg aat ctt gat gga gaa ate tea cct get att gga gat etc aag 
352 

Asp Leu Asn Leu Asp Gly Glu lie Ser Pro Ala lie Gly Asp Leu Lys 
80 85 90 

agt etc ttg tea att gat ctg ega ggt aat egc ttg tct gga caa ate 
400 

Ser Leu Leu Ser lie Asp Leu Arg Gly Asn Arg Leu Ser Gly Gin He 
95 100 105 

cct gat gag att ggt gac tgt tct tct ttg caa aac tta gae tta tec 
448 

Pro Asp Glu He Gly Asp Cys Ser Ser Leu Gin Asn Leu Asp Leu Ser 
110 115 120 

ttc aat gaa tta agt ggt gac ata ccg ttt teg att teg aag ttg aag 
496 

Phe Asn Glu Leu Ser Gly Asp He Pro Phe Ser lie Ser Lys Leu Lys 
125 130 135 ^ 

caa ctt gag cag ctg att ctg aag aat aac caa ttg ata gga ccg ate 
544 

Gin Leu Glu Gin Leu He Leu Lys Asn Asn Gin Leu He Gly Pro He 
145 150 155 

592 

Pro Ser Thr Leu Ser Gin He Pro Asn Leu Lys He Leu Asp Leu Ala 
160 165 170 

cag aat aaa etc agt ggt gag ata cca aga ctt att tac tgg aat gaa 
640 ^ 



Gin Asn Lys Leu Ser Gly Glu lie Pro Arg Leu lie Tyr Trp Asn Glu 
175 180 185 

gtt ctt cag tat ctt ggg ttg cga gga aac aac tta gtc ggt aac att 
5 688 

Val Leu Gin Tyr Leu Gly Leu Arg Gly Asn Asn Leu Val Gly Asn lie 
190 195 200 

tct cca gat ttg tgt caa ctg act ggt ctt tgg tat ttt gac gta aga 
10 736 

Ser Pro Asp Leu Cys Gin Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg 
205 210 215 220 

aac aac agt ttg act ggt agt ata cct gag acg ata gga aat tgc act 
15 784 

Asn Asn Ser Leu Thr Gly Ser lie Pro Glu Thr lie Gly Asn Cys Thr 
225 230 235 

gcc ttc cag gtt ttg gac ttg tec tac aat cag eta act ggt gag ate 
20 832 

Ala Phe Gin Val Leu Asp Leu Ser Tyr Asn Gin Leu Thr iSly Glu lie 
240 245 250 

cct ttt gac ate gge ttc ctg caa gtt gea aca tta tea ttg caa ggc 
25 880 

Pro Phe Asp lie Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly 
255 260 265 

aat caa etc tet ggg aag att cca tea gtg att ggt etc atg caa gcc 
30 928 ^ y 

Asn Gin Leu Ser Gly Lys lie Pro Ser Val lie Gly Leu Met Gin Ala 
270 275 280 

ctt gca gtc tta gat eta agt ggc aac ttg ttg agt gga tct att cct 
35 976 

Leu Ala Val Leu Asp Leu Ser Gly Asn Leu Leu Ser Gly Ser lie Pro 

285 290 295 300 

40 1024^*^*^ 

Pro He Leu Gly Asn Leu Thr Phe Thr Glu Lys* Leu Tyr Leu His Ser 
305 310 315 

aac aag ctg act ggt tea att cca cct gag ctt gga aac atg tea aaa 
45 1072 

Asn Lys Leu Thr Gly Ser He Pro Pro Glu Leu Gly Asn Met Ser Lys 
320 325 330 

50 1120*^^^ ^^"^ ""^^ ""^^ "^^^ "^^^ 

Leu His Tyr Leu Glu Leu Asn Asp Asn His Leu Thr Gly His He Pro 
335 340 345 

55 1168^^^ ""^^ ^""^ ^^"^ ^"^^ ^^"^ 

Pro Glu Leu Gly Lys Leu Thr Asp Leu Phe Asp Leu Asn Val Ala Asn 
350 355 360 




4 



aat gat ctg gaa gga cct ata cct gat cat ctg age tct tgc aca aat 
1216 

Asn Asp Leu Glu Gly Pro lie Pro Asp His Leu Ser Ser Cys Thr Asn 

5 365 370 375 380 

eta aae age tta aat gtt eat ggg aac aag ttt agt gge aet ata eee 
1264 

Leu Asn Ser Leu Asn Val His Gly Asn Lys Phe Ser Gly Thr lie Pro 

10 385 390 395 

cga gea ttt eaa aag eta gaa agt atg act tae ett aat ctg tee age 
1312 

Arg Ala Phe Gin Lys Leu Glu Ser Met Thr Tyr Leu Asn Leu Ser Ser 

15 400 405 410 

aac aat ate aaa ggt eca ate ccg gtt gag eta tet cgt ate ggt aae 
1360 

Asn Asn lie Lys Gly Pro lie Pro Val Glu Leu Ser Arg lie Gly Asn 

20 415 420 425 " 

tta gat aea ttg gat ctt tec aac aae aag ata aat gga ate att cct 
1408 

Leu Asp Thr Leu Asp Leu Ser Asn Asn Lys lie Asn Gly lie lie Pro 
25 430 435 440 

tct tee ett ggt gat ttg gag eat ett etc aag atg aac ttg agt aga 
1456 

Ser Ser Leu Gly TVsp Leu Glu His Leu Leu Lys Met Asn Leu Ser Arg 

30 445 450 455 460 

aat cat ata act ggt gta gtt eca gge gae ttt gga aat eta aga age 
1504 

Asn His He Thr Gly Val Val Pro Gly Asp Phe Gly Asn Leu Arg Ser 
35 465 470 475 

ate atg gaa ata gat ett tea aat aat gat ate tet gge eca att cea 
1552 

He Met Glu He Asp Leu Ser Asn Asn Asp He Ser Gly Pro He Pro 
40 480 485 490 

gaa gag ctt aac eaa tta cag aac ata att ttg ctg aga ctg gaa aat 
1600 

Glu Glu Leu Asn Gin Leu Gin Asn He He Leu Leu Arg Leu Glu Asn 
45 495 500 505 

aat aac ctg act ggt aat gtt ggt tea tta gee aac tgt etc agt etc 
1648 

Asn Asn Leu Thr Gly Asn Val Gly Ser Leu Ala Asn Cys Leu Ser Leu 

50 510 515 520 

act gta ttg aat gta tct cat aac aac etc gta ggt gat ate cct aag 
1696 

Thr Val Leu Asn Val Ser His Asn Asn Leu Val Gly Asp He Pro Lys 

55 525 530 535 540 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



aac aat aac btc tea aga ttt tea cca gac age tte att ggc aat cct 
1744 

Asn Asn Asn Phe Ser Arg Phe Ser Pro Asp Ser Phe lie Gly Asn Pro 
545 550 555 

ggt ctt tgc ggt agt tgg eta aac tea ecg tgt eat gat tet cgt cga 
1792 

Gly Leu Cys Gly Ser Trp Leu Asn Ser Pro Cys His Asp Ser Arg Arg 

560 565 570 

act gta cga gtg tea ate tet aga gca get att ctt gga ata get att 
1840 

Thr Val Arg Val Ser lie Ser Arg Ala Ala lie Leu Gly lie Ala lie 
575 580 585 

ggg gga ctt gtg ate ctt etc atg gtc tta ata gca get tgc, cga ecg 
1888 

Gly Gly Leu Val lie Leu Leu Met Val Leu He Ala Ala Cys Arg Pro 
590 595 600 

cat aat cct cct cct ttt ctt gat gga tea ctt gac aaa cca gta act 
1936 

His Asn Pro Pro Pro Phe Leu Asp Gly Ser Leu Asp Lys Pro Val Thr 

605 610 615 620 

tat teg aca ecg aag etc gtc ate ett eat atg aac atg gca etc cac 
1984 

Tyr Ser Thr Pro Lys Leu Val He Leu His Met Asn Met Ala Leu His 

625 630 635 

gtt tac gag gat ate atg aga atg aca gag aat eta agt gag aag tat 
2032 

Val Tyr Glu Asp He Met Arg Met Thr Glu Asn Leu Ser Glu Lys Tyr 
640 645 650 

ate att ggg cae gga gca tea age act gta tae aaa tgt gtt ttg aag 
2080 

He He Gly His Gly Ala Ser Ser Thr Val Tyr Lys Cys Val Leu Lys 
655 660 665 

aat tgt aaa ecg gtt geg att aag egg ett tac tet cac aac cca cag 
2128 

Asn Cys Lys Pro Val Ala He Lys Arg Leu Tyr Ser His Asn Pro Gin 

670 675 680 

tea atg aaa cag ttt gaa aca gaa etc gag atg eta agt age ate aag 
2176 

Ser Met Lys Gin Phe Glu Thr Glu Leu Glu Met Leu Ser Ser He Lys 
685 690 695 700 

cac aga aat ctt gtg age eta caa get tat tec etc tet cac ttg aaa 
2224 

His Arg Asn Leu Val Ser Leu Gin Ala Tyr Ser Leu Ser His Leu Gly 
705 710 715 

agt ctt ctg tte tat gac tat ttg gaa aat ggt age etc tgg gat ctt 

2272 



6 



Ser Leu Leu Phe 
720 

ctt cat ggc cct 
5 2320 

Leu His Gly Pro 
735 

aag ata gca tat: 
10 2368 

Lys lie Ala Tyr 
750 



tgt agt cca agg 
.15 2416 

Cys Ser Pro Arg 
765 

ttg gac aaa gac 
20 2464 

Leu Asp Lys Asp 



age ttg "tgt gtg 
25 2512 

Ser Leu Cys Val 
800 

ata ggt tac ata 
30 2560 

lie Gly Tyr lie 
815 

aaa tec gat gtc 
35 2608 

Lys Ser Asp Val 
830 

cga agg aaa gcc 
40 2656 

Arg Arg Lys 7U.a 
845 



tea aag acg ggg 
45 2704 

Ser Lys Thr Gly 



aca teg aeg tgt 
50 2752 

Thr Ser Thr Cys 
880 

gca etc eta tgc 
55 2800 

Ala Leu Leu Cys 
895 



Tyr Asp Tyr Leu Glu Asn 
725 

acg aag aaa aag act ctt 

Thr Lys Lys Lys Thr Leu 
740 

ggt gca gca caa ggt tta 

Gly Ala Ala Gin Gly Leu 
755 

ate att cac aga gac gtg 

He He His Arg Asp Val 
770 

tta gag get cgt ttg aca 

Leu Glu Ala Arg Leu Thr 
785 790 

tea aag tea cat act tea 

Ser Lys Ser His Thr Ser 
805 

gac cec gag tat get cgc 

Asp Pro Glu Tyr Ala Arg 
820 

tac agt tat gga ata gtc 

Tyr Ser Tyr Gly He Val 
835 

gtt gat gac gaa tec aat 

Val Asp Asp Glu Ser Pisn 
850 

aac aat gaa gtg atg gaa 

Asn Asn Glu Val Met Glu 
865 870 

aaa gat etc ggt gtg gtg 

Lys Asp Leu Gly Val Val 
885 

ace aaa aga cag ccg aat 

Thr Lys Arg Gin Pro Asn 
900 



Gly Ser Leu Trp Asp Leu 
730 

gat tgg gac aca egg ctt 

Asp Trp Asp Thr Arg Leu 
745 

get tat eta cac cat gac 

Ala Tyr Leu His His Asp 
760 

aag teg tec aac att etc 

Lys Ser Ser Asn He Leu 
775 780 

gat ttt gga ata gcg aaa 

Asp Phe Gly He Ala Lys 
795 

act tac gtg atg ggc acg 

Thr Tyr Val Met Gly Thr 
810 

act tea egg etc act gag 

Thr Ser Arg Leu Thr Glu 
825 

ctt ctt gag ttg tta ace 

Leu Leu Glu Leu Leu Thr 
840 

etc cac cat ctg ata atg 

Leu His His Leu He Met 
855 860 

atg gca gat cca gac ate 

Met Ala Asp Pro Asp He 
875 

aag aaa gtt ttc caa ctg 

Lys Lys Val Phe Gin Leu 
890 

gat cga cec aca atg cac 

Asp Arg Pro Thr Met His 
905 



cag gtg act cgt gtt etc ggc agt ttt atg eta teg gaa caa cca ect 
2848 

Gin Val Thr Arg Val Leu Gly Ser Phe Met Leu Ser Glu Gin Pro Pro 
5 910 915 920 

get geg act gae acg tea geg acg etg get ggt teg tgc tac gtc gat 
2896 

Ala /Via Thr Asp Thr Ser Ala Thr Leu Ala Gly Ser Cys Tyr Val Asp 

10 925 930 935 940 

gag tat gca aat etc aag act ect cat tet gtc aat tgc tct tec atg 
2944 

Glu Tyr Ala Asn Leu Lys Thr Pro His Ser Val Asn Cys Ser Ser Met 

15 945 950 955 

agt get tet gat get caa etg ttt ett egg ttt gga caa gtt att tct 
2992 

Ser Ala Ser Asp Ala Gin Leu Phe Leu Arg Phe Gly Gin Val lie Ser 
20 960 965 970 

cag aac agt gag tagtttttcg ttaggaggag aatctttaaa acggtatctt 
3044 

Gin Asn Ser Glu 
25 975 

ttcgttgcgt taagctgtta gaaaaattaa ^tgtctcatgt aaagtattat gcactgcctt 
3104 

30 attattatta gacaagtgtg tggtgtgaat atgtcttcag actggeaett agacttccta 
3164 



35 



40 



45 



taagttcttg cc 
3176 



<210> 2 

<211> 976 

<212> PRT 

<213> TVrabidopsis thaliana ERECTA allele 

<400> 2 



Met Ala Leu Phe Arg Asp lie Val Leu Leu Gly Phe Leu Phe Cys Leu 
50 1 S 10 15 



55 



Ser Leu Val Ala Thr Val Thr Ser Glu Glu Gly Ala Thr Leu Leu Glu 
20 25 30 



lie Lys Lys Ser Phe Lys Asp Val Asn Asn Val Leu Tyr Asp Trp Thr 



8 

35 40 45 



Thr Ser Pro Ser Ser Asp Tyr Cys Val Trp Arg Gly Val Ser Cys Glu 
5 50 55 60 



Asn Val Thr Phe Asn Val Val Ala Leu Asn Leu Ser Asp Leu Asn Leu 
65 70 75 80 

10 

Asp Gly Glu lie Ser Pro Ala lie Gly Asp Leu Lys Ser Leu Leu Ser 
85 90 95 

15 

lie Asp Leu Arg Gly Asn Arg Leu Ser Gly Gin lie Pro Asp Glu lie 
100 105 110 



20 Gly Asp Cys Ser Ser Leu Gin Asn Leu Asp Leu Ser Phe Asn Glu Leu 
115 120 125 



Ser Gly Asp lie Pro Phe Ser lie Ser Lys Leu Lys Gin Leu Glu Gin 
25 130 135 140 



Leu lie Leu Lys Asn Asn Gin Leu lie Gly Pro lie Pro Ser Thr Leu 
145 150 155 160 

30 

Ser Gin lie Pro Asn Leu Lys lie Leu Asp Leu Ala Gin Asn Lys Leu 
165 170 175 

35 

Ser Gly Glu lie Pro Arg Leu He Tyr Trp Asn Glu Val Leu Gin Tyr 
180 185 190 

40 Leu Gly Leu Arg Gly Asn Asn Leu Val Gly Asn He Ser Pro Asp Leu 
195 200 205 

Cys Gin Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg Asn Asn Ser Leu 
45 210 215 220 



Thr Gly Ser He Pro Glu Thr He Gly Asn Cys Thr Ala Phe Gin Val 
225 230 235 240 

50 

Leu Asp Leu Ser Tyr Asn Gin Leu Thr Gly Glu He Pro Phe Asp He 
245 250 255 

55 

Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn Gin Leu Ser 
260 265 270 



Gly Lys lie Pro Ser Val lie Gly Leu Met Gin Ala Leu Ala Val Leu 
275 280 285 

5 

Asp Leu Ser Gly Asn Leu Leu Ser Gly Ser lie Pro Pro He Leu Gly 
290 295 300 

10 

Asn Leu Thr Phe Thr Glu Lys Leu Tyr Leu His Ser Asn Lys Leu Thr 
305 310 315 320 

15 Gly Ser He Pro Pro Glu Leu Gly Asn Met Ser Lys Leu His Tyr Leu 

325 330 335 

Glu Leu Asn Asp Asn His Leu Thr Gly His He Pro Pro Glu Leu Gly 
20 340 345 3*50 



Lys Leu Thr Asp Leu Phe Asp Leu Asn Val Ala Asn Asn Asp Leu Glu 
355 360 365 

25 

Gly Pro He Pro Asp His Leu Ser Ser Cys Thr Asn Leu Asn Ser Leu 
370 375 380 

30 

Asn Val His Gly Asn Lys Phe Ser Gly Thr He Pro Arg Ala Phe Gin 
385 390 395 400 

35 Lys Leu Glu Ser Met Thr Tyr Leu Asn Leu Ser Ser Asn Asn He Lys 

405 410 415 

Gly Pro He Pro Val Glu Leu Ser Arg He Gly Asn Leu Asp Thr Leu 
40 420 425 430 

Asp Leu Ser Asn Asn Lys He Asn Gly He He Pro Ser Ser Leu Gly 
435 440 445 

45 

/ 

Asp Leu Glu His Leu Leu Lys Met Asn Leu Ser Arg Asn His He Thr 
450 455 460 

50 

Gly Val Val Pro Gly Asp Phe Gly Asn Leu Arg Ser He Met Glu He 

470 475 480 

55 Asp Leu Ser Asn Asn Asp He Ser Gly Pro He Pro Glu Glu Leu Asn 

485 490 495 



10 



Gin Leu Gin Asn lie lie Leu Leu Arg Leu Glu Asn Asn Asn Leu Thr 
500 505 510 

5 

Gly Asn Val Gly Ser Leu Ala Asn Cys Leu Ser Leu Thr Val Leu Asn 
515 520 525 



10 Val Ser His Asn Asn Leu Val Gly Asp lie Pro Lys Asn Asn Asn Phe 
530 535 540 



Ser Arg Phe Ser Pro Asp Ser Phe lie Gly Asn Pro Gly Leu Cys Gly 
15 545 550 555 560 



Ser Trp Leu Asn Ser Pro Cys His Asp Ser Arg Arg Thr Val Arg Val 
565 570 575 

20 

Ser lie Ser Arg Ala Ala lie Leu Gly lie Pila lie Gly Gly Leu Val 
580 585 590 

25 

lie Leu Leu Met Val Leu lie Ala Ala Cys Arg Pro His Asn Pro Pro 
595 600 605 

30 Pro Phe Leu Asp Gly Ser Leu Asp Lys Pro Val Thr Tyr Ser Thr Pro 
610 615 620 



Lys Leu Val lie Leu His Met Asn Met Ala Leu His Val Tyr Glu Asp 
35 625 630 635 640 



lie Met Arg Met Thr Glu Asn Leu Ser Glu Lys Tyr lie lie Gly His 
645 650 655 

40 

Gly Ala Ser Ser Thr Val Tyr Lys Cys Val Leu Lys Asn Cys Lys Pro 
660 665 670 

45 

Val Ala lie Lys Arg Leu Tyr Ser His Asn Pro Gin Ser Met Lys Gin 
675 680 685 

50 Phe Glu Thr Glu Leu Glu Met Leu Ser Ser lie Lys His Arg Asn Leu 
690 695 700 



Val Ser Leu Gin Ala Tyr Ser Leu Ser His Leu Gly Ser Leu Leu Phe 
55 705 710 715 720 



11 

Tyr Asp Tyr Leu Glu Asn Gly Ser Leu Trp Asp Leu Leu His Gly Pro 
725 730 735 



5 Thr Lys Lys Lys Thr Leu Asp Trp Asp Thr Arg Leu Lys lie Ala Tyr 
740 745 750 



Gly Ala Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys Ser Pro Arg 
10 755 760 765 



lie lie His Arg Asp Val Lys Ser Ser Asn lie Leu Leu Asp Lys Asp 
770 775 780 

15 

Leu Glu Ala Arg Leu Thr Asp Phe Gly lie Ala Lys Ser Leu Cys Val 
785 790 795 800 

20 

Ser Lys Ser His Thr Ser Thr Tyr Val Met Gly Thr lie Gly Tyr lie 
805 810 815 



25 Asp Pro Glu Tyr PlLs. Arg Thr Ser Arg Leu Thr Glu Lys Ser Asp Val 
820 825 830 



Tyr Ser Tyr Gly lie Val Leu Leu Glu Leu Leu Thr Arg Arg Lys Ala 
30 835 840 845 



Val Asp Asp Glu Ser Asn Leu His His Leu lie Met Ser Lys Thr Gly 
850 855 860 

35 

Asn Asn Glu Val Met Glu Met Ala Asp Pro Asp lie Thr Ser Thr Cys 
865 870 875 880 

40 

Lys Asp Leu Gly Val Val Lys Lys Val Phe Gin Leu Ala Leu Leu Cys 
885 890 . 895 

45 Thr Lys Arg Gin Pro Asn Asp Arg Pro Thr Met His Gin Val Thr Arg 
900 905 910 

Val Leu Gly Ser Phe Met Leu Ser Glu Gin Pro Pro Ala Ala Thr Asp 
50 915 920 925 

Thr Ser Ala Thr Leu Ala Gly Ser Cys Tyr Val Asp Glu Tyr Ala Asn 
930 935 940 

55 



Leu Lys Thr Pro His Ser Val Asn Cys Ser Ser Met Ser Ala Ser Asp 



12 



945 



950 



955 



960 



Ala Gin Leu Phe Leu Arg Phe Gly Gin Val lie Ser Gin Asn Ser Glu 
5 965 970 975 



<210> 3 

10 <211> 3000 

<212> DNA 

<213> rice ERECTA homolog 

15 

<220> 

20 <221> CDS 

<222> (I),. (2997) 



<223> 



25 



30 



35 



40 



45 



50 



55 



<400> 3 

atg gcg gcg gcg agg gcg 
48 

Met Ala Ala Ala Arg Ala 
1 5 



ccg tgg ctg tgg 
Pro Trp 



gtt ggt gtg gcg 
96 

Val Gly Val Ala 
20 

gat ggg gag ggg 
144 

Asp Gly Glu Gly 
35 

gcg gcc aac gcg 
192 

Ala Ala Asn Ala 
50 



gtg gcg 
Val Ala 

aag gcg 
Lys Ala 



gag gcg 
Glu Ala 



Leu Trp 
10 

gcc tec 

Ala Ser 
25 



tgg tgg 
Trp Trp 



gtg gtg gtg gtt 
Val Val 



ctg atg ggc gtg 
Gly Val 



Leu Met 
40 



gga gga gga gga 

Gly Gly Gly Gly 
30 

aag gcc ggt ttc 
Lys Ala 



etc gtc gac tgg 
Leu Val 



tgg cgc ggc gtc acc tgc 
240 

Trp Arg Gly Val Thr Cys 
65 70 



Asp Trp 
55 

gac aac 
Asp Asn 



gac ggc ggc gcc 

Asp Gly Gly Ala 
60 



Gly Phe 
45 

gac cac 
Asp His 



Val Val 
15 

ggg gga 
Gly Gly 

ggg aac 
Gly Asn 

tgc gcg 
Cys Ala 



gcc tec ttc gcc gtc etc gcc ctg 
Ala Ser 



Phe Ala 
75 



aac ttg tea aat 
288 

Asn Leu Ser Asn 



eta aac eta gga ggt gag ate teg 

lie Ser 



Leu Asn 
85 



Leu Gly Gly Glu 
90 



Val Leu Ala Leu 
80 

ccg gcc ate gga 

Pro Ala lie Gly 
95 



13 



gag etc aag aat eta eag ttc gtt gat cte aag ggg aac aag etc act 
336 

Glu Leu Lys Asn Leu Gin Phe Val Asp Leu Lys Gly Asn Lys Leu Thr 
5 100 105 110 

ggc caa ate cea gat gag att ggg gac tgc ate tec tta aaa tat ttg 
384 

Gly Gin lie Pro Asp Glu lie Gly Asp Cys lie Ser Leu Lys Tyr Leu 
10 115 120 125 

gat ttg tct ggc aac ttg ctg tat gga gac ate ecc ttc tec ate tec 
432 

Asp Leu Ser Gly Asn Leu Leu Tyr Gly Asp lie Pro Phe Ser lie Ser 
15 130 135 140 

aag cte aag eag ett gag gag ctg att ttg aag aac aac eag etc aeg 
480 

Lys Leu Lys Gin Leu Glu Glu Leu lie Leu Lys Asn Asn Gin Leu Thr 
20 145 150 155 160 

gga ccc ate cct tec aca ttg tec caa att cca aat etc aag aca ttg 
528 

Gly Pro lie Pro Ser Thr Leu Ser Gin lie Pro Asn Leu Lys Thr Leu 
25 165 170 175 

gac ctg gca eag aac cag ett aca ggc gat ate cca agg cte ata tac 
576 

Asp Leu Ala Gin Asn Gin Leu Thr Gly Asp lie Pro Arg Leu lie Tyr 
30 180 185 190 

tgg aat gaa gtt ctg caa tac eta ggt ttg agg ggt aac tea ctg act 
624 

Trp Asn Glu Val Leu Gin Tyr Leu Gly Leu Arg Gly Asn Ser Leu Thr 
35 195 200 205 

gga act ttg tea cct gac atg tgc caa ctg act ggc ctg tgg tac ttt 
672 

Gly Thr Leu Ser Pro Asp Met Cys Gin Leu Thr Gly Leu Trp Tyr Phe 
40 210 215 220 

gat gta agg gga aac aat etc aca ggg acc att cca gag age ata ggg 
720 

Asp Val Arg Gly Asn Asn Leu Thr Gly Thr lie Pro Glu Ser lie Gly 

45 225 230 235 240 

aac tgc acc age ttt gag att ctg gac att teg tat aac caa ate tct 
768 

Asn Cys Thr Ser Phe Glu He Leu Asp He Ser Tyr Asn Gin He Ser 
50 245 250 255 

gga ga^ ata cct tac aac ata ggc ttt ett caa gta gee aca ctg tea 
816 

Gly Glu He Pro Tyr Asn He Gly Phe Leu Gin Val Ala Thr Leu Ser 
55 260 265 270 
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55 
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ctt caa gga aat aga cfcg act ggg aaa att cca gat gtg att ggc ctg 
864 

Leu Gin Gly Asn Arg Leu Thr Gly Lys lie Pro Asp Val lie Gly Leu 
275 280 285 

atg caa get ctt get gtt eta gac ctg agt gag aac gag ctg gta ggg 
912 

Met Gin Ala Leu Ala Val Leu Asp Leu Ser Glu Asn Glu Leu Val Gly 
290 295 300 

ccc att cct tct ata ctg ggc aat eta tec tat act gga aaa eta tat 
960 

Pro lie Pro Ser lie Leu Gly Asn Leu Ser Tyr Thr Gly Lys Leu Tyr 

305 310 315 320 

tta cat ggg aac aaa ctt act gga gtc ata ccg ccg gag ctt ggg aac 
1008 

Leu His Gly Asn Lys Leu Thr Gly Val lie Pro Pro Glu Leu Gly Asn 
325 330 335 

atg agt aaa ctt age tac eta caa ctg aat gat aat gaa ttg gtg ggc 
1056 

Met Ser Lys Leu Ser Tyr Leu Gin Leu Asn Asp Asn Glu Leu Val Gly 
340 345 350 

aca att cca gca gag ctt ggc aaa ctt gaa gag ctt ttt gaa eta aat 
1104 

Thr lie Pro Ala Glu Leu Gly Lys Leu Glu Glu Leu Phe Glu Leu Asn 
355 360 365 

ctt gcc aac aac aat ctt caa ggt cct att cct gca aac ate agt tct 
1152 

Leu Ala Asn Asn Asn Leu Gin Gly Pro lie Pro Ala Asn lie Ser Ser 
370 375 380 

tgc act get eta aac aaa ttc aat gtt tat ggc aat aag eta aat ggt 
1200 

Cys Thr Ala Leu Asn Lys Phe Asn Val Tyr Gly Asn Lys Leu Asn Gly 
385 390 395 400 

tct att cct get ggt ttc cag aag ttg gag agt ctg act. tac ttg aac 
1248 

Ser lie Pro Ala Gly Phe Gin Lys Leu Glu Ser Leu Thr Tyr Leu Asn 
405 410 415 

eta tct tea aac aat ttc aaa ggc aat att cct tct gag ctt ggt cac 
1296 

Leu Ser Ser Asn Asn Phe Lys Gly Asn lie Pro Ser Glu Leu Gly His 
420 425 430 

ate ate aac ttg gac aca ttg gat ctt tee tac aat gaa ttc tct gga 
1344 

lie lie Asn Leu Asp Thr Leu Asp Leu Ser Tyr Asn Glu Phe Ser Gly 
435 440 445 

cca gtt cct get ace att ggt gat eta gag cac ctt ctt gaa ctg aat 
1392 



15 

Pro Val .Pro Ala Thr lie Gly Asp Leu Glu His Leu Leu Glu Leu Asn 
450 455 460 

ttg agt aag aac cat ctt gat ggg cca gtt cct get gag ttt gga aac 
5 1440 

Leu Ser Lys Asn His Leu Asp Gly Pro Val Pro Ala Glu Phe Gly Asn 

465 470 475 480 

ttg aga age gtc caa gta att gat atg tec aac aac aac tta tct ggt 
10 1488 

Leu Arg Ser Val Gin Val lie Asp Met Ser Asn Asn Asn Leu Ser Gly 
485 490 495 

agt ctg ccc gag gaa ctt gga caa ctt caa aac ctt gat age etg att 
15 1536 

Ser Leu Pro Glu Glu Leu Gly Gin Leu Gin Asn Leu Asp Ser Leu lie 

500 505 510 

ctt aac aac aac aat ttg gtt ggg gag ate cct get caa ttg gee aac 
20 1584 

Leu Asn Asn Asn Asn Leu Val Gly Glu lie Pro Ala Gin Leu Ala Asn 

515 520 525 

tgc ttc age tta aat aac ctt gca ttt cag gaa ttt gtc ata caa caa 
25 1632 

Cys Phe Ser Leu Asn Asn Leu Ala Phe Gin Glu Phe Val lie Gin Gin 
530 535 540 

ttt ate tgg aca tgt ccc gat ggc aaa gaa ctt etc gaa att ccc aat 
30 1680 

Phe lie Trp Thr Cys Pro Asp Gly Lys Glu Leu Leu Glu lie Pro Asn 

545 550 555 560 

gga aag cat ctt eta att tct gat tgc aac cag tac ata aat cat aaa 
35 1728 

Gly Lys His Leu Leu lie Ser Asp Cys Asn Gin Tyr lie Asn His Lys 
565 570 575 

tgc age ttc ttg ggt aat cca tta ctg cat gtt tac tgc caa gat tec 
40 1776 

Cys Ser Phe Leu Gly Asn Pro Leu Leu His Val Tyr Cys Gin Asp Ser 
580 585 590 

age tgt gga cac tct cat gga caa aga gtt aat att tea aag aca gca 
45 1824 

Ser Cys Gly His Ser His Gly Gin Arg Val Asn lie Ser Lys Thr Ala 
595 600 605 

att get tgc att ate tta ggc ttt ate ata ttg etc tgc gtt etg etg 
50 1872 

He Ala Cys He He Leu Gly Phe He He Leu Leu Cys Val Leu Leu 
610 615 620 

ttg get ata tat aaa aca aat caa cca cag cca ctt gtc aaa gga tec 
55 1920 

Leu Ala He Tyr Lys Thr Asn Gin Pro Gin Pro Leu Val Lys Gly Ser 

625 630 635 640 



16 



gat aag cca gtg caa gga cct cca aag eta gtt gtt etc eag atg gae 
1968 

Asp Lys Pro Val Gin Gly Pro Pro Lys Leu Val Val Leu Gin Met Asp 
5 645 650 655 

atg get ate eat act tac gag gae ate atg agg etg aea gag aat ttg 
2016 

Met Ala lie His Thr Tyr Glu Asp lie Met Arg Leu Thr Glu Asn Leu 

10 660 665 670 

age gag aaa tac ate att ggc tat gge gee tea age act gte tac aaa 
2064 

Ser Glu Lys Tyr lie lie Gly Tyr Gly Ala Ser Ser Thr Val Tyr Lys 

15 675 680 685 

tgt gaa etc aag age gge aag gee att get gte aag egg ett tac agt 
2112 

Cys Glu Leu Lys Ser Gly Lys Ala lie Ala Val Lys Arg Leu Tyr Ser 
20 690 695 700 

cag tat aac eat age etc cga gag ttt gaa aea gaa eta gag aca att 
2160 

Gin Tyr Asn His Ser Leu Arg Glu Phe Glu Thr Glu Leu Glu Thr lie 

25 705 710 715 720 

ggc age ata egg eac agg aat ett gtt age etc cat ggc ttc teg eta 
2208 

Gly Ser lie Arg His Arg Asn Leu Val Ser Leu His Gly Phe Ser Leu 
30 725 730 735 

tet cca cat gga aac ttg etc ttc tat gat tac atg gaa aat ggt tec 
2256 

Ser Pro His Gly Asn Leu Leu Phe Tyr Asp Tyr Met Glu Asn Gly Ser 
35 740 745 750 

ttg tgg gat ett etc eac ggt cca tea aag aaa gtg aag etc aac tgg 
2304 

Leu Trp Asp Leu Leu His Gly Pro Ser Lys Ijys Val Lys Leu Asn Trp 

40 755 760 765 

gae aca aga ctg agg ate gcg gte gga get gca caa ggg etg gee tat 
2352 

Asp Thr Arg Leu Arg He Ala Val Gly Ala Ala Gin Gly Leu Ala Tyr 
45 770 775 780 

etc eac cat gae tgc aac cct cgc ata ate eac aga gat gte aag tec 
2400 

Leu His His Asp Cys Asn Pro Arg He He His Arg Asp Val Lys Ser 

50 785 790 795 800 

tec aac ate etg etc gae gag aac ttc gaa gcg cac etc tea gat ttc 
2448 

Ser Asn He Leu Leu Asp Glu Asn Phe Glu Ala His Leu Ser Asp Phe 
55 805 810 815 
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ggc ata gcc aaa tgt gtc ccc tct gcc aag tec cat gee tec act tat 
2496 

Gly lie Ala Lys Cys Val Pro Ser Ala Lys Ser His Ala Ser Thr Tyr 
820 . 825 830 

gtg eta gga acc ate gge tac att gat ccg gag tat gcc agg act tec 
2544 

Val Leu Gly Thr He Gly Tyr He Asp Pro Glu Tyr Ala Arg Thr Ser 
835 840 845 

agg etc aat gag aaa tct gat gtg tac age tte ggc ate gtc ctt ctg 
2592 

Arg Leu Asn Glu Lys Ser Asp Val Tyr Ser Phe Gly He Val Leu Leu 
850 855 860 

gaa ttg etc aca ggg aag aag gee gtc gae aac gaa teg aac ttg cat 
2640 

Glu Leu Leu Thr Gly Lys Lys Ala Val Asp Asn Glu Ser Asn Leu His 

865 870 875 880 

caa ttg ata etc tee aaa get gat gae aac aca gtc atg gag gca gtg 
2688 

Gin Leu He Leu Ser Lys Ala Asp Asp Asn Thr Val Met Glu Ala Val 
885 890 895 

gae teg gag gtg tea gtg aeg tgc acg gae atg gga ctg gtc agg aag 
2736 

Asp Ser Glu Val Ser Val Thr Cys Thr Asp Met Gly Leu Val Arg Lys 
900 905 910 

gcc ttc cag etc gcc ctt ctg tgc acc aag agg cac cet tea gae egg 
2784 

Ala Phe Gin Leu Ala Leu Leu Cys Thr Lys Arg His Pro Ser Asp Arg 
915 920 925 

ccg acc atg cac gag gtt gca agg gtg ctg etc tec ctg ctg ccg gcc 
2832 

Pro Thr Met His Glu Val Ala Arg Val Leu Leu Ser Leu Leu Pro Ala 
930 935 940 

tec gcc atg aca acg ccc aag acg gtg gae tac tec egg ttg ctg gcg 
2880 

Ser Ala Met Thr Thr Pro Lys Thr Val Asp Tyr Ser Arg Leu Leu 7\la 

945 950 955 960 

teg aeg acg acg gcg gcc gae atg cga ggg cac gae gtg acc gae ate 
2928 

Ser Thr Thr Thr Ala Ala Asp Met Arg Gly His Asp Val Thr »Asp He 
965 970 975 

ggc gae aac age tec tec gae gag cag tgg ttc gtc agg ttc ggc gag 
2976 

Gly Asp Asn Ser Ser Ser Asp Glu Gin Trp Phe Val Arg Phe Gly Glu 
980 985 990 

gtc ata tec aag cac aca atg tga 
3000 



18 



Val lie Ser Lys His Thr Met 
995 



5 <210> 4 
<211> 999 
<212> PRT 

10 

<213> rice ERECTA homolog 
15 <400> 4 

Met Ala Ala Ala Arg Ala Pro Trp Leu Trp Trp Trp Val Val Val Val 
15 10 15 

20 

Val Gly Val Ala Val Ala Glu Ala Ala Ser Gly Gly Gly Gly Gly Gly 
20 25 30 



25 Asp Gly Glu Gly Lys Ala Leu Met Gly Val Lys Ala Gly Phe Gly Asn 
35 40 45 



Ala Ala Asn Ala Leu Val Asp Trp Asp Gly Gly Ala Asp His Cys Ala 
30 50 55 60 



Trp Arg Gly Val Thr Cys Asp Asn Ala Ser Phe Ala Val Leu Ala Leu 
65 70 75 80 

35 

Asn Leu Ser Asn Leu Asn Leu Gly Gly Glu lie Ser Pro Ala lie Gly 
85 90 95 

40 

Glu Leu Lys Asn Leu Gin Phe Val Asp Leu Lys Gly Asn Lys Leu Thr 
100 105 110 

45 Gly Gin lie Pro Asp Glu lie Gly Asp Cys lie Ser Leu Lys Tyr Leu 
115 120 125 

Asp Leu Ser Gly Asn Leu Leu Tyr Gly Asp lie Pro Phe Ser He Ser 
50 130 135 140 

Lys Leu Lys Gin Leu Glu Glu Leu He Leu Lys Asn Asn Gin Leu Thr 
145 150 155 160 

55 



Gly Pro He Pro Ser Thr Leu Ser Gin He Pro Asn Leu Lys Thr Leu 



19 

165 170 175 



Asp Leu Ala Gin Asn Gin Leu Thr Gly Asp lie Pro Arg Leu lie Tyr 
5 180 185 190 



Trp Asn Glu Val Leu Gin Tyr Leu Gly Leu Arg Gly Asn Ser Leu Thr 
195 200 205 

10 

Gly Thr Leu Ser Pro Asp Met Cys Gin Leu Thr Gly Leu Trp Tyr Phe 
210 215 220 

15 

Asp Val Arg Gly Asn Asn Leu Thr Gly Thr lie Pro Glu Ser lie Gly 
225 230 235 240 



20 Asn Cys Thr ,Ser Phe Glu lie Leu Asp lie Ser Tyr Asn Gin lie Ser 

245 250 255 



Gly Glu lie Pro Tyr Asn lie Gly Phe Leu Gin Val Ala Thr Leu Ser 
25 260 265 270 



Leu Gin Gly Asn Arg Leu Thr Gly Lys lie Pro Asp Val He Gly Leu 
275 280 285 

30 

Met Gin Ala Leu Ala Val Leu Asp Leu Ser Glu Asn Glu Leu Val Gly 
290 295 300 

35 

Pro He Pro Ser He Leu Gly Asn Leu Ser Tyr Thr Gly Lys Leu Tyr 
305 310 315 320 

40 Leu His Gly Asn Lys Leu Thr Gly Val He Pro Pro Glu Leu Gly Asn 

325 330 335 

Met Ser Lys Leu Ser Tyr Leu Gin Leu Asn Asp Asn Glu Leu Val Gly 
45 340 345 350 

Thr He Pro Ala Glu Leu Gly Lys Leu Glu Glu Leu Phe Glu Leu Asn 
355 360 365 

50 

Leu Ala Asn Asn Asn Leu Gin Gly Pro He Pro Ala Asn He Ser Ser 
370 375 380 

55 

Cys Thr Ala Leu Asn Lys Phe Asn Val Tyr Gly Asn Lys Leu Asn Gly 
385 390 395 400 



20 



Ser lie Pro Ala Gly .Phe Gin Lys Leu Glu Ser Leu Thr Tyr Leu Asn 
405 410 415 

5 

Leu Ser Ser Asn Asn Phe Lys Gly Asn lie Pro Ser Glu Leu Gly His 
420 425 430 

10 

lie lie Asn Leu Asp Thr Leu Asp Leu Ser Tyr Asn Glu Phe Ser Gly 
435 440 445 



15 Pro Val Pro Ala Thr lie Gly Asp Leu Glu His Leu Leu Glu Leu Asn 
450 455 460 



Leu Ser Lys Asn His Leu Asp Gly Pro Val Pro Ala Glu Phe Gly Asn 
20 465 470 475 480 



Leu Arg Ser Val Gin Val lie Asp Met Ser Asn Asn Asn Leu Ser Gly 
485 490 495 

25 

Ser Leu Pro Glu Glu Leu Gly Gin Leu Gin Asn Leu Asp Ser Leu lie 
500 505 510 

30 

Leu Asn Asn Asn Asn Leu Val Gly Glu lie Pro Ala Gin Leu Ala Asn 
515 520 525 



35 Cys Phe Ser Leu Asn Asn Leu Ala Phe Gin Glu Phe Val lie Gin Gin 
530 535 540 



Phe lie Trp Thr Cys Pro Asp Gly Lys Glu Leu Leu Glu lie Pro Asn 
40 545 550 555 560 



Gly Lys His Leu Leu He Ser Asp Cys Asn Gin Tyr He Asn His Lys 
565 570 575 

45 

Cys Ser Phe Leu Gly Asn Pro Leu Leu His Val Tyr Cys Gin Asp Ser 
580 585 590 

50 

Ser Cys Gly His Ser His Gly Gin Arg Val Asn He Ser Lys Thr Ala 
595 600 605 



55 He Ala Cys He He Leu Gly Phe He He Leu Leu Cys Val Leu Leu 
610 615 620 



21 



Leu Ala lie Tyr Lys Thr Asn Gin Pro Gin Pro Leu Val Lys Gly Ser 
625 630 635 640 

5 

Asp Lys Pro Val Gin Gly Pro Pro Lys Leu Val Val Leu Gin Met Asp 
645 650 655 



10 Met Ala lie His Thr Tyr Glu Asp lie Met Arg Leu Thr Glu Asn Leu 
660 665 670 



Ser Glu Lys Tyr lie lie Gly Tyr Gly Ala Ser Ser Thr Val Tyr Lys 
15 675 680 685 ■ 



Cys Glu Leu Lys Ser Gly Lys Ala lie Ala Val Lys Arg Leu Tyr Ser 
690 695 700 

20 

Gin Tyr Asn His Ser Leu Arg Glu Phe Glu thr Glu Leu Glu Thr He 
705 710 715 720 

25 

Gly Ser He Arg His Arg Asn Leu Val Ser Leu His Gly Phe Ser Leu 
725 730 735 



30 Ser Pro His Gly Asn Leu Leu Phe Tyr Asp Tyr Met Glu Asn Gly Ser 
740 745 750 



Leu Trp Asp Leu Leu His Gly Pro Ser Lys Lys Val Lys Leu Asn Trp 
35 755 760 765 



Asp Thr Arg Leu Arg He Ala Val Gly Ala Ala Gin Gly Leu Ala Tyr 
770 775 780 

40 

Leu His His Asp Cys Asn Pro Arg He He His Arg Asp Val Lys Ser 
785 790 795 800 

45 

Ser Asn He Leu Leu Asp Glu Asn Phe Glu Ala His Leu Ser Asp Phe 
805 810 815 

50 Gly He Ala Lys Cys Val Pro Ser Ala Lys Ser His Ala Ser Thr Tyr 
820 825 830 



Val Leu Gly Thr He Gly Tyr He Asp Pro Glu Tyr Ala Arg Thr Ser 
55 835 840 845 



22 

Arg Leu Asn Glu Lys Ser Asp Val Tyr Ser Phe Gly lie Val Leu Leu 
850 855 860 



5 Glu Leu Leu Thr Gly Lys Lys Ala Val Asp Asn Glu Ser Asn Leu His 
865 870 875 880 



Gin Leu lie Leu Ser Lys Ala Asp Asp Asn Thr Val Met Glu Ala Val 
10 885 890 895 



Asp Ser Glu Val Ser Val Thr Cys Thr Asp Met Gly Leu Val Arg Lys 
900 905 910 • 

Ala Phe Gin Leu Ala Leu Leu Cys Thr Lys Arg His Pro Ser Asp Arg 
915 920 925 

20 

Pro Thr Met His Glu Val Ala Arg Val Leu Leu Ser Leu Leu Pro Ala 
930 935 940 



25 Ser Ala Met Thr Thr Pro Lys Thr Val Asp Tyr Ser Arg Leu Leu Ala 
945 950 955 960 



Ser Thr Thr Thr Ala Ala Asp Met Arg Gly His Asp Val Thr Asp lie 
30 965 970 975 



Gly Asp Asn Ser Ser Ser Asp Glu Gin Trp Phe Val Arg Phe Gly Glu 
980 985 990 

35 

Val lie Ser Lys His Thr Met 
995 

40 

<210> 5 
<211> 2766 
45 <212> DNA 

<213> sorghum ERECTA homolog 

50 

<220> 

<221> CDS 
55 <222> (1)..(2763) 
<223> 
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<400> 5 

5 atg acg acg acg gcc gcc cgt get etc gtc gcc etc etc etc gtc gee 
48 

Met Thr Thr Thr Ala Ala Arg Ala Leu Val Ala Leu Leu Leu Val Ala 
15 10 15 

10 gtc gcc gtc gcc gac gat ggg gcg acg ctg gtg gag ate aag aag tec 
96 

Val Ala Val Ala Asp Asp Gly Ala Thr Leu Val Glu lie Lys Lys Ser 
20 25 30 



15 


ttc 


cgc 


aac 


gtc ggc 


aac 


gta 


ctg 


tac 


gat tgg gcc 


ggc 


gac 


gac 


tac 




144 






























Phe 


Arg 


Asn 


Val 


Gly 


Asn 


Val 


Leu 


Tyr 


Asp Trp Ala 


Gly 


Asp 


Asp 


Tyr 








35 










40 






45 








20 


tgc 


tec 


tgg 


cgc 


ggc 


gtc 


ctg 


tgc 


gac 


aac gtc aca 


ttc 


gee 


gtc 


get 




192 






























Cys 


Ser 


Trp 


Arg 


Gly 


Val 


Leu 


Cys 


Asp 


Asn Val Thr 


Phe 


Ala 


Val 


Ala 






50 










55 






60 










25 


gcg 


etc 


aac 


etc 


tet 


ggc 


etc 


aac 


ett 


gag ggc gag 


ate 


tet 


cea 


gcc 




240 






























Ala 


Leu 


TVsn 


Leu 


Ser 


Gly Leu Asn 


Leu 


Glu Gly Glu 


lie 


Ser 


Pro 


Ala 




65 










70 








75 








80 


30 


gtc 


ggc 


age 


etc 


aag 


age 


etc 


gtc 


tee 


ate gat ctg 


aag 


tea 


aat 


ggg 




288 






























Val 


Gly 


Ser 


Leu 


Lys 


Ser 


Leu 


Val 


Ser 


lie Asp Leu 


Lys 


Ser 


Asn Gly 












85 










90 






95 




35 


eta 


tec 


ggg 


cag 


ate 


cct 


gat 


gag 


att 


ggt gat tgt 


tea 


tea 


ctt 


agg 



336 

Leu Ser Gly Gin lie Pro Asp Glu lie Gly Asp Cys Ser Ser Leu Arg 
100 105 110 

40 acg ctg gac ttt tet ttc aac aac ttg gat ggc gac ata cea ttt tet 
384 

Thr Leu Asp Phe Ser Phe Asn Asn Leu Asp Gly Asp He Pro Phe Ser 

115 120 125 

45 ata tea aag ctg aag cae ctg gag aac ttg ata ttg aag aac aac cag 
432 

He Ser Lys Leu Lys His Leu Glu Asn Leu He Leu Lys Asn Asn Gin 
130 135 140 

50 ctg att ggt gcg ate cea tea aca ttg tea cag etc cea aat ttg aag 
480 

Leu He Gly Ala He Pro Ser Thr Leu Ser Gin Leu Pro Asn Leu Lys 

145 150 155 160 

55 att ttg gat ttg gea caa aac aaa ctg act ggg gag ata cea agg ett 
528 

He Leu Asp Leu Ala Gin Asn Lys Leu Thr Gly Glu He Pro Arg Leu 



165 

ate tac tgg 
576 

5 lie Tyr Trp 



titg acc ggg 
624 

10 Leu Thr Gly 

195 

gtc ttg gat 
672 

15 Val Leu Asp 
210 

att ggt ttc 
720 

20 lie Gly Phe 
225 

acc ggt cca 
768 

25 Thr Gly Pro 



eta gat ctg 
816 

30 Leu Asp Leu 



ggc aac ttg 
864 

35 Gly Asn Leu 
275 

act ggg teg 

912 

40 Thr Gly Ser 
290 

eta gaa ctg 
960 

45 Leu Glu Leu 
305 

gga agg eta 
1008 
50 Gly Arg Leu 



gaa gga cca 
1056 
55 Glu Gly Pro 




170 

aat gag gtt ctt caa 

Asn Glu Val Leu Gin 
180 

gtg ata cca gac ace 

Val lie Pro Asp Thr 
200 

ttg tet tac aac egc 

Leu Ser Tyr Pisn Arg 
215 

eta caa gtg get aca 

Leu Gin Val Ala Thr 
230 

att cet tea gta att 

lie Pro Ser Val lie 
245 

agt tac aac caa tta 

Ser Tyr Asn Gin Leu 
260 

aca tac act gag aag 

Thr Tyr Thr Glu Lys 
280 

ata cca cca gag tta 

lie Pro Pro Glu Leu 
295 

aac gat aat caa ctt 

Asn Asp Asn Gin Leu 
310 

aca ggc ttg ttt gac 

Thr Gly Leu Phe Asp 
325 

att cct gac aac eta 

lie Pro Asp Asn Leu 
340 



24 

175 

tat ctt gat gtg 

Tyr Leu Asp Val 
185 

att ggg aac tgt 
lie Gly Asn Cys 

ttt act gga cca 

Phe Thr Gly Pro 
220 

eta tec ttg caa 

Leu Ser Leu Gin 
235 

ggt ctt atg cag 

Gly Leu Met Gin 
250 

tct ggt cct ata 

Ser Gly Pro lie 
265 

ctg tac ate caa 

Leu Tyr lie Gin 

gga aat atg tea 

Gly Asn Met Ser 
300 

act ggg tea att 

Thr Gly Ser lie 
315 

ctg aac ctt gcg 

Leu Asn Leu Ala 
330 

agt tea tgt gtg 

Ser Ser Cys Val 
345 




aag aac aat age 

Lys Asn Asn Ser 
190 

aca agt ttt caa 

Thr Ser Phe Gin 
205 

ate cca ttc aac 
lie Pro Phe Asn 

ggg aac aag ttc 

Gly Pksn Lys Phe 
240 

get etc get gtt 

Ala Leu Ala Val 
255 

cca tea ata eta 

Pro Ser lie Leu 
270 

ggc aat aag tta 

Gly Asn Lys Leu 
285 

aca ctt cat tac 

Thr Leu His Tyr 

cca cca gag ctt 

Pro Pro Glu Leu 
320 

aat aac cac ctg 

Asn Asn His Leu 
335 

aat etc aat age 

Asn Leu Asn Ser 
350 
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25 

ttc aat get tat ggc aac aag tta aat ggg acc att cct cgt teg ttg 
1104 

Phe Asn Ala Tyr Gly Asn Lys Leu Asn Gly Thr lie Pro Arg Ser Leu 

355 360 365 

egg aaa ctt gaa age atg acc tat tta aat ctg tea tea aae tte ata 
1152 

Arg Lys Leu Glu Ser Met Thr Tyr Leu Asn Leu Ser Ser Asn Phe lie 
370 375 380 

Q^gt ggc tct att cct att gag tta tea agg ate aac aat ttg gae aeg 
1200 

Ser Gly Ser lie Pro lie Glu Leu Ser Arg lie Asn Asn Leu Asp Thr 
385 390 395 400 

ctg gat tta tec tgt aac atg atg act ggt cca att cca tea tea att 
1248 

Leu Asp Leu Ser Cys Asn Met Met Thr Gly Pro lie Pro Ser Ser lie 
405 410 415 

ggc age eta gag eat eta ttg aga ctt aae ttg age aag aat ggt eta 
1296 

Gly Ser Leu Glu His Leu Leu Arg Leu Asn Leu Ser Lys Asn Gly Leu 
420 425 430 

gtt gga ttc ate cce geg gag ttt ggt aat ttg agg agt gtc atg gag 
1344 

Val Gly Phe He Pro Ala Glu Phe Gly Asn Leu Arg Ser Val Met Glu 
435 440 445 

att gat tta tee tat aat cae ctt ggt ggc ctg att cct caa gaa ctt 
1392 

He Asp Leu Ser Tyr Asn His Leu Gly Gly Leu He Pro Gin Glu Leu 
450 455 460 

gaa atg ctg caa aac ctg atg ttg eta aat gtg teg tac aat aat ttg 
1440 

Glu Met Leu Gin Asn Leu Met Leu Leu Asn Val Ser Tyr Asn Asn Leu 
465 470 475 480 

get ggt gtt gtc cct get gac aac aac ttc aca cgcf ttt tea cct gae 
1488 

Ala Gly Val Val Pro Ala Asp Asn Asn Phe Thr Arg Phe Ser Pro Asp 

485 490 495 

age ttt tta ggt aat cct gga etc tgt gga tac tgg ctt ggt teg teg 
1536 

Ser Phe Leu Gly Asn Pro Gly Leu Cys Gly Tyr Trp Leu Gly Ser Ser 
500 505 510 

tgt cgt tec act ggc cac cac gag aaa ccg cct ate tea aaa act acc 
1584 

Cys Arg Ser Thr Gly His His Glu Lys Pro Pro He Ser Lys Ala Ala 
515 520 525 

ata att ggt gtt get gtg ggt gga ctt gtt ate etc ttg atg ate tta 
1632 
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He He Gly Val Ala Val Gly Gly Leu Val He Leu Leu Met He Leu 
530 535 540 

gta get gtt tgc agg cca cat cgt cca cct get ttt aaa gat gtc act 
5 1680 

Val Ala Val Cys Arg Pro His Arg Pro Pro Ala Phe Lys Asp Val Thr 

545 550 555 560 

gta age aag cca gtg aga aat get ccc ecc aag ctg gtg ate ctt cat 
10 1728 

Val Ser Lys Pro Val Arg Asn Ala Pro Pro Lys Leu Val He Leu His 
565 570 575 

atg aac atg gcc ctt cat gta tac gat gac ata atg agg atg act gag 
15 1776 

Met Asn Met Ala Leu His Val Tyr Asp Asp He Met Arg Met Thr Glu 
580 585 590 

aac ttg agt gag aaa tac ate att gga tac ggg gcg tea agt aca gtt 
20 1824 

Asn Leu Ser Glu Lys Tyr He lie Gly Tyr Gly Ala Ser Ser Thr Val 
595 600 605 

tat aaa tgt gtc eta aag aat tgc aaa ccg gtg gea ata aaa aag ctg 
25 1872 

Tyr Lys Cys Val Leu Lys Asn Cys Lys Pro Val Ala He Lys Lys Leu 
610 615 620 

tat gee eac tac cca cag age ctt, aag gaa ttt gaa act gag ctt gag 
30 1920 

Tyr Ala His Tyr Pro Gin Ser Leu Lys Glu Phe Glu Thr Glu Leu Glu 

625 630 635 640 

act gtt ggt age ate aag eac egg aat eta gtc age ctt caa ggg tac 
35 1968 

Thr Val Gly Ser He Lys His Arg Asn Leu Val Ser Leu Gin Gly Tyr 
645 650 655 

tea tta tea cct gtt ggg aac etc etc ttt tat gat tat atg gaa tgt 
40 2016 

Ser Leu Ser Pro Val Gly Asn Leu Leu Phe Tyr Asp Tyr Met Glu Cys 
660 665 670 

ggc age tta tgg gat gtt tta cat gaa ggt tea tec aag aag aaa aaa 
45 2064 

Gly Ser Leu Trp Asp Val Leu His Glu Gly Ser Ser Lys Lys Lys Lys 
675 680 685 

ctt gac tgg gag act cgc eta egg att get ctt ggt gea get caa ggc 
50 2112 

Leu Asp Trp Glu Thr Arg Leu Arg He Ala Leu Gly Ala Ala Gin Gly 
690 695 700 

ctt get tac ctt eac cat gac tgc agt cca egg ata att cat egg gat 
55 2160 

Leu Ala Tyr Leu His His Asp Cys Ser Pro Arg He He His Arg Asp 
705 710 715 720 
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gta aaa tea aag aat ata etc ctt gac aaa gat tat gag gcc cat ctt 

2208 

Val Lys Ser Lys Asn lie Leu Leu Asp Lys Asp Tyr Glu Ala His Leu 
5 725 730 735 

aca gac ttt gga att get aag age tta tgt gtc tea aaa act cae aca 
2256 

Thr Asp Phe Gly lie Ala Lys Ser Leu Cys Val Ser Lys Thr His Thr 
10 740 745 750 

tea ace tat gtc atg gga act att ggc tac att gat cct gag tac gcc 
2304 

Ser Thr Tyr Val Met Gly Thr lie Gly Tyr lie Asp Pro Glu Tyr TVla 
15 755 760 765 

cgc act tec cgt etc aac gaa aag tet gat gtc tac agg eta tgg cat 
2352 

Arg Thr Ser TVrg Leu Asn Glu Lys Ser Asp Val Tyr Arg Leu Trp His 
20 770 775 780 

tgt tet get gga get get gac tgg caa gaa gcc agt gga caa cga ate 
2400 

Cys Ser Ala Gly Ala TVla Asp Trp Gin Glu Ala Ser Gly Gin Arg lie 

25 785 790 795 800 

eta teg aag aeg gea age aac gag gtc atg gat ace gtg gac cct gac 
2448 

Leu Ser Lys Thr Ala Ser Asn Glu Val Met Asp Thr Val TVsp Pro Asp 
30 805 810 815 

ate ggg gac ace tgc aag gac etc ggc gag gtg aag aag ctg ttc cag 

2496 

lie Gly Asp Thr Cys Lys Asp Leu Gly Glu Val Lys Lys Leu Phe Gin 
35 820 825 830 

ctg gcg etc ctt tgc ace aag egg caa cec teg gac cga ccg acg atg 
2544 

Leu Ala Leu Leu Cys Thr Lys Arg Gin Pro Ser Asp Arg Pro Thr Met 

40 835 840 845 

cae gag gtg gtg cgc gtc ctg gac tgc ctg gtg aac ccg gac ccg ccg 
2592 

His Glu Val Val Arg Val Leu Asp Cys Leu Val Asn Pro Asp Pro Pro 
45 850 855 860 

cca aag ccg teg gcg cae cag ctg ccg cag ccg teg eca gcc gtg cca 
2640 

Pro Lys Pro Ser Ala His Gin Leu Pro Gin Pro Ser Pro Ala Val Pro 

50 865 870 875 880 

age tac ate aac gag tac gtc age ctg egg ggc ace ggc get etc tee 
2688 

Ser Tyr lie Asn Glu Tyr Val Ser Leu Arg Gly Thr Gly Ala Leu Ser 
55 885 890 895 
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tgc gcc aac teg acc age acc teg gae gee gag etg tte etc aag tte 
2736 

Cys Ala Asn Ser Thr Ser Thr Ser Asp Ala Glu Leu Phe Leu Lys Phe 
900 905 910 

5 

ggc gag gcc ate teg cag aac atg gag tag 
2766 

Gly Glu Ala lie Ser Gin Asn Met Glu 
915 920 

10 

<210> 6 
<211> 921 

15 

<212> PRT 

<213> sorghum ERECTA homolog 

20 

<400> 6 

Met Thr Thr Thr Ala Ala Arg Ala Leu Val Ala Leu Leu Leu Val Ala 
25 1 5 10 15 



Val Ala Val Ala Asp Asp Gly Ala Thr Leu Val Glu lie Lys Lys Ser 
20 25 30 

30 

Phe Arg Asn Val Gly Asn Val Leu Tyr Asp Trp Ala Gly Asp Asp Tyr 
35 40 45 

35 

Cys Ser Trp Arg Gly Val Leu Cys Asp Asn Val Thr Phe Ala Val Ala 
50 55 60 

40 Ala Leu Asn Leu Ser Gly Leu Asn Leu Glu Gly Glu lie Ser Pro Ala 
65 70 75 80 

Val Gly Ser Leu Lys Ser Leu Val Ser lie Asp Leu Lys Ser Asn Gly 
45 85 90 95 



Leu Ser Gly Gin lie Pro Asp Glu lie Gly Asp Cys Ser Ser Leu Arg 
100 105 110 

50 

Thr Leu Asp Phe Ser Phe Asn Asn Leu Asp Gly Asp lie Pro Phe Ser 
115 120 125 

55 

lie Ser Lys Leu Lys His Leu Glu Asn Leu lie Leu Lys Asn Asn Gin 
130 135 140 
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Leu lie Gly Ala lie Pro Ser Thr Leu Ser Gin Leu Pro Asn Leu Lys 
145 150 155 160 

5 

lie Leu Asp Leu Ala Gin Asn Lys Leu Thr Gly Glu lie Pro Arg Leu 
165 170 175 

10 

He Tyr Trp Asn Glu Val Leu Gin Tyr Leu Asp Val Lys Asn Asn Ser 
180 185 190 



15 Leu Thr Gly Val He Pro Asp Thr He Gly Asn Cys Thr Ser Phe Gin 
195 200 205 • 



Val Leu Asp Leu Ser Tyr Asn Arg Phe Thr Gly Pro He Pro Phe Asn 
20 210 215 220 



He Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn Lys Phe 
225 230 235 240 

25 

Thr Gly Pro He Pro Ser Val He Gly Leu Met Gin T^a Leu TVla Val 
245 250 255 

30 

Leu Asp Leu Ser Tyr Asn Gin Leu Ser Gly Pro He Pro Ser He Leu 
260 265 270 



35 Gly Asn Leu Thr Tyr Thr Glu Lys Leu Tyr He Gin Gly Asn Lys Leu 
275 280 285 



Thr Gly Ser He Pro Pro Glu Leu Gly Asn Met Ser Thr Leu His Tyr 
40 290 295 300 



Leu Glu Leu Asn Asp Asn Gin Leu Thr Gly Ser He Pro Pro Glu Leu 
305 310 315 320 

45 

Gly Arg Leu Thr Gly Leu Phe Asp Leu Asn Leu Ala Asn Asn His Leu 
325 330 335 

50 

Glu Gly Pro He Pro Asp Asn Leu Ser Ser Cys Val Asn Leu Asn Ser 
340 345 350 



55 Phe Asn Ala Tyr Gly Asn Lys Leu Asn Gly Thr He Pro Arg Ser Leu 
355 360 365 



30 



Arg Lys Leu Glu Ser Met Thr Tyr Leu Asn Leu Ser' Ser Asn Phe lie 
370 375 380 

5 

Ser Gly Ser lie Pro lie Glu Leu Ser Arg lie Asn Asn Leu Asp Thr 
385 390 395 400 



10 Leu Asp Leu Ser Cys TVsn Met Met Thr Gly Pro lie Pro Ser Ser lie 

405 410 415 



Gly Ser Leu Glu His Leu Leu Arg Leu Asn Leu Ser Lys Asn Gly Leu 
15 420 425 430 



Val Gly Phe He Pro Ala Glu Phe Gly Asn Leu Arg Ser Val Met Glu 
435 440 445 

20 

He Asp Leu Ser Tyr Asn His Leu Gly Gly Leu He Pro Gin Glu Leu 
450 455 460 

25 

Glu Met Leu Gin TVsn Leu Met Leu Leu Asn Val Ser Tyr Asn TVsn Leu 
465 470 475 480 

30 Ala Gly Val Val Pro Ala Asp Asn Asn Phe Thr 7\xg Phe Ser Pro Asp 

485 490 495 



Ser Phe Leu Gly Asn Pro Gly Leu Cys Gly Tyr Trp Leu Gly Ser Ser 
35 500 505 510 



Cys Arg Ser Thr Gly His His Glu Lys Pro Pro He Ser Lys Ala Ala 
515 520 525 

40 

He He Gly Val Ala Val Gly Gly Leu Val He Leu Leu Met He Leu 
530 535 540 

45 

Val Ala Val Cys Arg Pro His Arg Pro Pro Ala Phe Lys Asp Val Thr 
545 550 555 560 

50 Val Ser Lys Pro Val Arg Asn Ala Pro Pro Lys Leu Val He Leu His 

565 570 575 



Met Asn Met Ala Leu His Val Tyr Asp Asp He Met Arg Met Thr Glu 
55 580 585 590 
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Asn Leu Ser Glu Lys Tyr lie He Gly Tyr Gly Ala Ser Ser Thr Val 
595 600 605 



5 Tyr Lys Cys Val Leu Lys Asn Cys Lys Pro Val Ala He Lys Lys Leu 
610 615 620 



Tyr Ala His Tyr Pro Gin Ser Leu Lys Glu Phe Glu Thr Glu Leu Glu 
10 625 630 635 640 

Thr Val Gly Ser He Lys His Arg Asn Leu Val Ser Leu Gin Gly Tyr 
645 650 655 

15 

Ser Leu Ser Pro Val Gly Asn Leu Leu Phe Tyr Asp Tyr Met Glu Cys 
660 665 670 

20 

Gly Ser Leu Trp Asp Val Leu His Glu Gly Ser Ser Lys Lys Lys Lys 
675 680 685 



25 Leu Asp Trp Glu Thr Arg Leu Arg He Ala Leu Gly Ala Ala Gin Gly 
690 695 700 



Leu Ala Tyr Leu His His Asp Cys Ser Pro Arg He He His Arg Asp 
30 705 710 715 720 



Val Lys Ser Lys Asn He Leu Leu Asp Lys Asp Tyr Glu Ala His Leu 
725 730 735 

35 

Thr Asp Phe Gly He Ala Lys Ser Leu Cys Val Ser Lys Thr His Thr 
740 745 750 

40 

Ser Thr Tyr Val Met Gly Thr He Gly Tyr He Asp Pro Glu Tyr Ala 
755 760 765 • 

45 Arg Thr Ser Arg Leu Asn Glu Lys Ser Asp Val Tyr Arg Leu Trp His 
770 775 780 



Cys Ser Ala Gly Ala TULa Asp Trp Gin Glu Ala Ser Gly Gin Arg He 
50 785 790 795 800 



Leu Ser Lys Thr Ala Ser Asn Glu Val Met Asp Thr Val Asp Pro Asp 
805 810 815 

55 



He Gly Asp Thr Cys Lys Asp Leu Gly Glu Val Lys Lys Leu Phe Gin 
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820 825 830 



Leu Ala Leu Leu Cys Thr Lys Arg Gin Pro Ser Asp Arg Pro Thr Met 
5 835 840 845 



His Glu Val Val Arg Val Leu Asp Cys Leu Val Asn Pro Asp Pro Pro 
850 855 860 

10 

Pro Lys Pro Ser Ala His Gin Leu Pro Gin Pro Ser "Pro Ala Val Pro 
865 870 875 880 

15 

Ser Tyr lie Asn Glu Tyr Val Ser Leu Arg Gly Thr Gly Ala Leu Ser 
885 890 895 



20 Cys Ala Asn Ser Thr Ser Thr Ser Asp Ala Glu Leu Phe Leu Lys Phe 
900 905 910 



Gly Glu Ala lie Ser Gin Asn Met Glu 
25 915 920 



<210> 7 
30 <211> 2799 
<212> DNA 

<213> Arabdopsis thaliana ERECTA homolog 

35 

<220> 
40 <221> CDS 

<222> (1)..(2796) 
<223> 

45 



<400> 7 

atg gcg ata aag get tea ttc age aae gtg gcg aat atg ctt ctt gat 
50 48 

Met Ala lie Lys Ala Ser Phe Ser Asn Val Ala Asn Met Leu Leu Asp 
15 10 15 

tgg gac gat gtt cat aac cac gac ttt tgt tet tgg aga ggt gtc ttc 
55 96 

Trp Asp Asp Val His Asn His Asp Phe Cys Ser Trp Arg Gly Val Phe 

20 25 30 
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tgt gat aac gtt age etc aat gtt gtc tct ctt aat ctg tea aac ctg 
144 

Cys Asp Asn Val Ser Leu Asn Val Val Ser Leu Asn Leu Ser Asn Leu 

5 35 40 45 

aat ctt ggt gga gag ata tea tet gee ctt gga gat ttg atg aat ctg 
192 

Asn Leu Gly Gly Glu lie Ser Ser Ala Leu Gly TVsp Leu Met Asn Leu 

10 50 55 60 

caa tea ata gac ttg caa gga aat aaa ttg ggt ggt eaa att cca gat 

240 

Gin Ser lie Asp Leu Gin Gly Asn Lys Leu Gly Gly Gin He Pro Asp 
15 65 70 75 80 

gag att gga aac tgt gtt tct ctt get tat gtg gat tte tee ace aat 
2^88 

Glu He Gly Asn Cys Val Ser Leu Ala Tyr Val Asp Phe Ser Thr Asn 
20 85 90 95 

ttg ttg ttt gga gac ata ccg ttt tea ate tct aaa etc aaa cag ctg 
336 

Leu Leu Phe Gly Asp He Pro Phe Ser lie Ser Lys Leu Lys Gin Leu 
25 100 105 110 

gag ttt ctg aac eta aag aat aat cag etc act ggt cca ata cca gca 
384 

Glu Phe Leu Asn Leu Lys Asn Asn Gin Leu Thr Gly Pro He Pro Ala 

30 115 120 125 

ace tta act cag att cca aac ctt aag ace ctt gac etc gca aga aac 
432 

Thr Leu Thr Gin He Pro Asn Leu Lys Thr Leu Asp Leu Ala Arg Asn 
35 130 135 140 

cag ctt act ggt gag ata cca agg tta etc tac tgg aat gaa gtt tta 
480 

Gin Leu Thr Gly Glu He Pro Arg Leu Leu Tyr Trp Asn Glu Val Leu 
40 145 150 155 160 

cag tat etc ggt tta cgt ggg aat atg tta act ggg aca ttg tct cet 
528 

Gin Tyr Leu Gly Leu Arg Gly Asn Met Leu Thr Gly Thr Leu Ser Pro 
45 165 170 175 

gat atg tgt cag ctg acg ggt ctg tgg tac ttt gat gtg aga ggc aac 
576 

Asp Met Cys Gin Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg Gly Asn 
50 180 185 190 

aac ctt act gga act ate cca gag age att ggc aat tgc aca age ttt 
624 

Asn Leu Thr Gly Thr He Pro Glu Ser He Gly Asn Cys Thr Ser Phe 
55 195 200 205 
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gag ate ttg gat gta tct tat aat cag att acc gga gtt ata ccc tac 
672 

Glu lie Leu Asp Val Ser Tyr Asn Gin lie Thr Gly Val lie Pro Tyr 
210 215 220 

aat att ggt ttc etc eaa gta get act ctg tea ett caa gga aae aag 
720 

Asn lie Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn Lys 

225 230 235 240 

ttg act gge aga att ccg gaa gtg att ggt ctg atg cag get ctt get 
768 

Leu Thr Gly Arg lie Pro Glu Val He Gly Leu Met Gin Ala Leu Ala 
245 250 255 

gta ttg gat ttg agt gac aat gaa tta act ggg cet att cca cca ata 
816 

Val Leu Asp Leu Ser Asp Asn Glu Leu Thr Gly Pro He Pro Pro He 
260 265 270 

ctt ggg aat ctg tea ttc act gga aaa ctg tat etc cat gge aae aag 
864 

Leu Gly Asn Leu Ser Phe Thr Gly Lys Leu Tyr Leu His Gly Asn Lys 
275 280 285 

etc act gga caa ate cca ccc gag eta gge aat atg tea cga etc age 
912 

Leu Thr Gly Gin He Pro Pro Glu Leu Gly Asn Met Ser Arg Leu Ser 
290 295 300 

tat ttg eaa eta aat gat aat gaa eta gtg gga aag ate cca ect gag 
960 

Tyr Leu Gin Leu Asn Asp Asn Glu Leu Val Gly Lys He Pro Pro Glu 

305 310 315 320 

ctt ggg aag ctg gaa eaa ttg ttc gaa ctg aat ett geg aae aae aat 
1008 

Leu Gly Lys Leu Glu Gin Leu Phe Glu Leu Asn Leu Ala Asn Asn Asn 
325 330 335 

ctt gta ggg ctg att cca tct aae att agt tec tgt get gee ttg aat 
1056 

Leu Val Gly Leu He Pro Ser Asn He Ser Ser Cys Ala Ala Leu Asn 
340 345 350 

caa ttc aat gtt cat ggg aae ttc ttg agt gga get gta cca ett gaa 
1104 

Gin Phe Asn Val His Gly Asn Phe Leu Ser Gly Ala Val Pro Leu Glu 

355 360 365 

ttc egg aat ett gga age ttg act tat eta aat ctt tec tea aae agt 
1152 

Phe Arg Asn Leu Gly Ser Leu Thr Tyr Leu Asn Leu Ser Ser Asn Ser 
370 375 380 

ttc aag gge aaa ata ect get gag ett gge cat ate ate aat ett gat 
1200 
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Phe Lys Gly Lys lie Pro Ala Glu Leu Gly His lie lie Asn Leu Asp 
385 390 395 400 

aca ttg gat ctg tct ggc aac aat ttc tea ggc tea att cca tta aca 
5 1248 

Thr Leu Asp Leu Ser Gly Asn Asn Phe Ser Gly Ser lie Pro Leu Thr 
405 410 415 

ctt ggt gat ctt gag cat ctt etc ate tta aac ttg age aga aat cat 
10 1296 

Leu Gly Asp Leu Glu His Leu Leu lie Leu Asn Leu Ser Arg Asn His 

420 425 430 

ctg aat ggc aca ttg cct gea gaa ttc ggg aac etc cga age att cag 
15 1344 

Leu Asn Gly Thr Leu Pro Ala Glu Phe Gly Asn Leu Arg Ser lie Gin 

435 440 445 

ate ate gat gtg tea ttt aat ttt ctt gee ggt gtt att cca act gaa 
20 1392 

lie lie Asp Val Ser Phe Asn Phe Leu Ala Gly Val lie Pro Thr Glu 
450 455 460 

ctt ggc cag ttg cag aac ata aac tct ctg ata ctg aac aac aac aag 
25 1440 

Leu Gly Gin Leu Gin Asn lie Asn Ser Leu lie Leu Asn Asn Asn Lys 

465 470 475 480 

att cat ggg aaa ate cct gat cag eta act aac tgc ttc agt ctt gee 
30 1488 

lie His Gly Lys lie Pro Asp Gin Leu Thr Asn Cys Phe Ser Leu Ala 

485 490 495 

aat ctg aac ate tec ttc aat aat ctt tct gga ata ate cca cct atg 
35 1536 

Asn Leu Asn lie Ser Phe Asn Asn Leu Ser Gly lie lie Pro Pro Met 
500 505 510 

aag aac ttt aca cgt ttt tee ecg gee age ttc ttt gga aat cca ttt 
40 1584 

Lys Asn Phe Thr Arg Phe Ser Pro Ala Ser Phe Phe Gly Asn Pro Phe 
515 520 525 

etc tgc ggg aac tgg gtt gga tea ate tgt ggc cca tct tta cct aag 
45 1632 

Leu Cys Gly Asn Trp Val Gly Ser lie Cys Gly Pro Ser Leu Pro Lys 
530 535 540 

tea caa gta ttc acc aga gtt gcc gtg att tgt atg gtt etc ggt ttc 
50 1680 

Ser Gin Val Phe Thr Arg Val Ala Val lie Cys Met Val Leu Gly Phe 

545 550 555 560 

ate act etc ata tgc atg ata ttc att geg gtt tac aag tea aag cag 
55 1728 

He Thr Leu He Cys Met He Phe He Ala Val Tyr Lys Ser Lys Gin 
565 570 575 
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cag aaa cca gtc ttg aaa ggc tct tea aaa caa cct gaa ggg tea acg 
1776 

Gin Lys Pro Val Leu Lys Gly Ser Ser Lys Gin Pro Glu Gly Ser Thr 
5 580 585 590 

aag ctg gtg att ctt cac atg gac atg get att eae acg ttt gat gat 
1824 

Lys Leu Val lie Leu His Met Asp Met Ala lie His Thr Phe Asp Asp 

10 595 600 605 

ate atg aga gtt aca gaa aac ete gat gag aaa tae ate att gga tac 
1872 

lie Met Arg Val Thr Glu Asn Leu Asp Glu Lys Tyr lie lie Gly Tyr 

15 610 615 620 

ggt get tct age aca gtt tac aag tgc ace tec aaa act tec cga ect 
1920 

Gly Ala Ser Ser Thr Val Tyr Lys Cys Thr Ser Lys Thr Ser Arg Pro 

20 625 630 635 640 

att gee att aag cga ate tae aat eag tat eee age aac tte ege gag 
1968 

lie Ala lie Lys Arg lie Tyr Asn Gin Tyr Pro Ser Asn Phe Arg Glu 
25 645 650 655 

ttt gaa aca gag ete gag ace att ggg age ate aga eae aga aac ata 
2016 

Phe Glu Thr Glu Leu Glu Thr lie Gly Ser lie Arg His> Arg Asn lie 

30 660 665 670 

gta age ttg cac gga tac gee tta tct eee ttt ggc aac etc etc tte 
2064 

Val Ser Leu His Gly Tyr Ala Leu Ser Pro Phe Gly Asn Leu Leu Phe 
35 675 680 685 

tac gac tac atg gaa aat ggc tct ctt tgg gat ctt etc eat ggg cct 
2112 

Tyr Asp Tyr Met Glu Asn Gly Ser Leu Trp Asp Leu Leu His Gly Pro 
40 690 695 700 

ggg aag aag gtg aag ctt gac tgg gaa aca agg ctg aag ata get gtt 
2160 

Gly Lys Lys Val Lys Leu Asp Trp Glu Thr Arg Leu Lys lie Ala Val 

45 705 710 715 720 

gga get gcg caa gga ctt gea tat ctt cac cat gac tgc aca cct agg 
2208 

Gly Ala Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys Thr Pro Arg 

50 725 730 735 

ata ate cat cga gac ate aag tea tea aac ata etc ctt gat ggg aat 
2256 

lie He His Arg Asp He Lys Ser Ser Asn He Leu' Leu Asp Gly Asn 

55 740 745 750 
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ttc gaa gcg cgt ttg tea gat ttt ggg att gcc aag age ata cca gcc 
2304 

Phe Glu Ala Arg Leu Ser Asp Phe Gly lie Ala Lys Ser lie Pro Ala 
755 760 765 

5 

acc aaa act tat get tea ace tat gtt ctt gga aec att gga tat att 
2352 

Thr Lys Thr Tyr Ala Ser Thr Tyr Val Leu Gly Thr lie Gly Tyr lie 
770 775 780 

10 

gac cea gag tat get ega aet teg cgt ctg aac gag aag tct gat ate 
2400 

Asp Pro Glu Tyr Ala Arg Thr Ser Arg Leu TVsn Glu Lys Ser Asp lie 
785 790 795 800 

15 

tac agt ttc ggt att gtc ctt ctt gag ctt eta acc ggc aag aag get 
2448 

Tyr Ser Phe Gly lie Val Leu Leu Glu Leu Leu Thr Gly Lys Lys Ala 
805 810 815 

20 

gtg gat aac gag gcc aac ttg cat caa atg att eta tea aag gcg gat 
2496 

Val Asp Asn Glu Ala Asn Leu His Gin Met lie Leu Ser Iiys Ala Asp 
820 825 830 

25 

gat aac aea gta atg gaa get gtt gat gea- gag gtc tea gtg act tgc 
2544 

Asp Asn Thr Val Met Glu Ala Val Asp Ala Glu Val Ser Val Thr Cys 
835 840 845 

30 

atg gac tea gga eae ate aag aaa aea ttt eag eta get etc ttg tgc 
2592 

Met Asp Ser Gly His He Lys Lys Thr Phe Gin Leu Ala Leu Leu Cys 
850 855 860 

35 

acc aag ega aat cct ttg gag aga cee acc atg cag gag gtc tct agg 
2640 

Thr Lys Arg Asn Pro Leu Glu Arg Pro Thr Met Gin Glu Val Ser Arg 
865 870 875 880 

40 

gtt ctg etc tea ctt gtc ccg tct cca cct cea aag aag tta ccg teg 
2688 

Val Leu Leu Ser Leu Val Pro Ser Pro Pro Pro Lys Lys Leu Pro Ser 
885 890 895 

45 

cct gea aaa gta cag gaa ggg gaa gaa egg cgt gag age cac tct tea 
2736 

Pro Ala Lys Val Gin Glu Gly Glu Glu Arg Arg Glu Ser His Ser Ser 
900 905 910 

50 

gat aea aea acc cca cag tgg ttt gtt cag ttc cgt gaa gat ate tec 
2784 

Asp Thr Thr Thr Pro Gin Trp Phe Val Gin Phe Arg Glu Asp He Ser 
915 920 925 

55 

aaa agt age tta taa 
2799 
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Lys Ser Ser Leu 
930 



5 <210> 8 
<211> 932 
<212> PRT 



10 

<213> Arabdopsis thaliana ERECTA homolog 



15 <400> 8 

Met Ala He Lys Ala Ser Phe Ser Asn Val Ala Asn Met Leu Leu Asp 
15 10 15 

20 

Trp Asp Asp Val His Asn His Asp Phe Cys Ser Trp Arg Gly Val Phe 
20 25 30 



25 Cys Asp Asn Val Ser Leu Asn Val Val Ser Leu Asn Leu Ser Asn Leu 
35 40 45 



Asn Leu Gly Gly Glu He Ser Ser Ala Leu Gly Asp Leu Met Asn Leu 
30 50 55 60 



Gin Ser He Asp Leu Gin Gly Asn Lys Leu Gly Gly Gin He Pro Asp 
65 70 75 80 

35 

Glu He Gly Asn Cys Val Ser Leu Ala Tyr Val Asp Phe Ser Thr Asn 
85 90 95 

40 

Leu Leu Phe Gly Asp He Pro Phe Ser He Ser Lys Leu Lys Gin Leu 
100 105 110 



45 Glu Phe Leu Asn Leu Lys Asn Asn Gin Leu Thr Gly Pro lie Pro Ala 
115 120 125 



Thr Leu Thr Gin He Pro Asn Leu Lys Thr Leu Asp Leu Ala Arg Asn 
50 130 135 140 



Gin Leu Thr Gly Glu He Pro Arg Leu Leu Tyr Trp Asn Glu Val Leu 
145 150 155 160 

55 



Gin Tyr Leu Gly Leu Arg Gly Asn Met Leu Thr Gly Thr Leu Ser Pro 
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165 170 175 



Asp Met Cys Gin Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg Gly Asn 
5 180 185 190 



Asn Leu Thr Gly Thr lie Pro Glu Ser He Gly Asn Cys Thr Ser Phe 
195 200 205 

10 

Glu He Leu Asp Val Ser Tyr Asn Gin He Thr Gly Val He Pro Tyr 
210 215 220 

15 

Asn He Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn Lys 
225 230 235 240 



20 Leu Thr Gly Arg He Pro Glu Val He Gly Leu Met Gin Ala Leu Ala 

245 250 255 



Val Leu Asp Leu Ser Asp Asn Glu Leu Thr Gly Pro He Pro Pro He 
25 260 265 270 



Leu Gly Asn Leu Ser Phe Thr Gly Lys Leu Tyr Leu His Gly Asn Lys 
275 280 285 

30 

Leu Thr Gly Gin He Pro Pro Glu Leu Gly Asn Met Ser Arg Leu Ser 
290 295 300 

35 

Tyr Leu Gin Leu Asn Asp Asn Glu Leu Val Gly Lys He Pro Pro Glu 
305 310 315 320 

40 Leu Gly Lys Leu Glu Gin Leu Phe Glu Leu Asn Leu Ala Asn Asn Asn 

325 330 335 



Leu Val Gly Leu He Pro Ser Asn He Ser Ser Cys Ala Ala Leu Asn 
45 340 345 350 



Gin Phe Asn Val His Gly Asn Phe Leu Ser Gly Ala Val Pro Leu Glu 
355 360 365 

50 

Phe Arg Asn Leu Gly Ser Leu Thr Tyr Leu Asn Leu Ser Ser Asn Ser 
370 375 380 

55 

Phe Lys Gly Lys He Pro Ala Glu Leu Gly His He He Asn Leu Asp 
385 390 395 400 
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Thr Leu TVsp Leu Ser Gly Asn Asn Phe Ser Gly Ser lie Pro Leu Thr 
405 410 415 

5 

Leu Gly Asp Leu Glu His Leu Leu lie Leu Asn Leu Ser Arg Asn His 
420 425 430 

10 

Leu Asn Gly Thr Leu Pro Ala Glu Phe Gly Asn Leu Arg Ser lie Gin 
435 440 445 



15 lie lie Asp Val Ser Phe Asn Phe Leu Ala Gly Val lie Pro Thr Glu 
450 455 460 



Leu Gly Gin Leu Gin Asn lie Asn Ser Leu lie Leu Asn Asn Asn Lys 
20 465 470 475 480 



lie His Gly Lys lie Pro Asp Gin Leu Thr Asn Cys Phe Ser Leu Ala 
485 490 495 

25 

Asn Leu Asn He Ser Phe Asn Asn Leu Ser Gly He He Pro Pro Met 
500 505 510 

30 

Lys Asn Phe Thr Arg Phe Ser Pro Ala Ser Phe Phe Gly Asn Pro Phe 
515 520 525 

35 Leu Cys Gly Asn Trp Val Gly Ser He Cys Gly Pro Ser Leu Pro Lys 
530 535 540 

Ser Gin Val Phe Thr Arg Val Ala Val He Cys Met Val Leu Gly Phe 
40 545 550 555 560 

He Thr Leu He Cys Met He Phe He Ala Val Tyr Lys Ser Lys Gin 
565 570 575 

45 

Gin Lys Pro Val Leu Lys Gly Ser Ser Lys Gin Pro Glu Gly Ser Thr 
580 585 590 

50 

Lys Leu Val He Leu His Met Asp Met Ala He His Thr Phe Asp Asp 
595 600 605 



55 He Met Arg Val Thr Glu Asn Leu Asp Glu Lys Tyr He He Gly Tyr 
610 615 620 
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Gly J\la Ser Ser Thr Val Tyr Lys Cys Thr Ser Lys Thr Ser Arg Pro 
625 630 635 640 

5 

lie Ala lie Lys Arg lie Tyr Asn Gin Tyr Pro Ser Asn Phe Arg Glu 
645 650 655 



10 Phe Glu Thr Glu Leu Glu Thr lie Gly Ser He Arg His Arg Asn He 
660 . 665 670 



Val Ser Leu His Gly Tyr Ala Leu Ser Pro Phe Gly Asn Leu Leu Phe 
15 675 680 685 



Tyr Asp Tyr Met Glu Asn Gly Ser Leu Trp Asp Leu Leu His Gly Pro 
690 695 700 

20 

Gly Lys Lys Val Lys Leu Asp Trp Glu Thr Arg Leu Lys lie Ala Val 
705 710 715 ' 720 

25 

Gly Ala Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys Thr Pro Arg 
725 730 735 



30 He He His Arg Asp He Lys Ser Ser Asn He Leu Leu Asp Gly Asn 
740 745 750 



Phe Glu Ala Arg Leu Ser. Asp Phe Gly He Ala Lys Ser He Pro Ala 
35 755 760 765 



Thr Lys Thr Tyr Ala Ser Thr Tyr Val Leu Gly Thr He Gly Tyr He 
770 775 780 

40 

Asp Pro Glu Tyr Ala Arg Thr Ser Arg Leu Asn Glu Lys Ser Asp He 
785 790 795 800 

45 

Tyr Ser Phe Gly He Val Leu Leu Glu Leu Leu Thr Gly Lys Lys Ala 
805 810 815 

50 Val Asp Asn Glu Ala Asn Leu His Gin Met He Leu Ser Lys Ala TVsp 
820 825 830 



Asp Asn Thr Val Met Glu Ala Val Asp Ala Glu Val Ser Val Thr Cys 
55 835 840 845 
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Met Asp Ser Gly His lie Lys hys Thr Phe Gin Leu Ala lieu Leu Cys 
850 855 860 



5 Thr Lys Arg Asn Pro Leu Glu Arg Pro Thr Met Gin Glu Val Ser Arg 
865 870 875 880 



Val Leu Leu Ser Leu Val Pro Ser Pro Pro Pro Lys Lys Leu Pro Ser 
10 885 890 895 



Pro Ala Lys Val Gin Glu Gly Glu Glu Arg Arg Glu Ser His Ser Ser 
900 905 910 

15 

Asp Thr Thr Thr Pro Gin Trp Phe Val Gin Phe Arg Glu Asp lie Ser 
915 920 925 

20 

Lys Ser Ser Leu 
930 



25 <210> 9 

<211> 2901 

<212> DNA 
30 \ 

<213> Aradopsis thaliana ERECTA homolog 



35 <220> 

<221> CDS 

<222> (1) . • (2898) 

40 

<223> 



45 <400> 9 

atg aag gag aag atg cag cga atg gtt tta tct tta gca atg gtg ggt 
48 

Met Lys Glu Lys Met Gin Arg Met Val Leu Ser Leu Ala Met Val Gly 
15 10 15 

50 

ttt atg gtt ttt ggt gtt get teg get atg aae aae gaa ggg aaa get 
96 

Phe Met Val Phe Gly Val Ala Ser Ala Met Asn Asn Glu Gly Lys Ala 
20 25 30 

55 

ctg atg geg ata aaa gge tct ttc age aae tta gtg aat atg ett ttg 
144 
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Leu Met Ala lie Lys Gly Ser Phe Ser Asn Leu Val Asn Met Leu Leu 
35 40 45 

gat tgg gac gat gtt cac aac agt gac ttg tgt tct tgg cga ggt gtt 
5 192 

Asp Trp Asp Asp Val His Asn Ser Asp Leu Cys Ser Trp Arg Gly Val 
50 55 60 

ttc tgc gac aac gtt age tac tec gtt gtc tct ctg aat ttg tec agt 
10 240 

Phe Cys Asp Asn Val Ser Tyr Ser Val Val Ser Leu Asn Leu Ser Ser 

65 70 75 80 

ctg aat ctt gga ggg gag ata tct cca get att gga gac eta egg aat 
15 288 

Leu Asn Leu Gly Gly Glu lie Ser Pro Ala lie Gly Asp Leu Arg Asn 
85 90 95 

ttg caa tea ata gac ttg caa ggt aat aaa eta gea ggt eaa att cca 
20 336 

Leu Gin Ser lie Asp Leu Gin Gly Asn Lys Leu Ala Gly Gin lie Pro 
100 105 110 

gat gag att gga aac tgt get tct ett gtt tat ctg gat ttg tee gag 
25 384 

Asp Glu lie Gly Asn Cys Ala Ser Leu Val Tyr Leu Asp Leu Ser Glu 
115 120 125 

aat ctg tta tat gga gac ata cet ttc tea ate tct aaa etc aag eag 
30 432 

Asn Leu Leu Tyr Gly TVsp lie Pro Phe Ser lie Ser Lys Leu Lys Gin 
130 135 140 

ctt gaa act ctg aat ctg aag aac aat eag etc aca ggt cet gta cca 
35 480 

Leu Glu Thr Leu Asn Leu Lys Asn Asn Gin Leu Thr Gly Pro Val Pro 
145 150 155 160 

gea ace tta ace cag att cca aac ett aag aga ett gat ett get gge 
40 528 

Ala Thr Leu Thr Gin lie Pro Asn Leu Lys 7\xg Leu Asp Leu Ala Gly 
165 170 175 

aat cat eta acg ggt gag ata teg aga ttg ett tac tgg aat gaa gtt 
45 576 

Asn His Leu Thr Gly Glu lie Ser Arg Leu Leu Tyr Trp Asn Glu Val 
180 185 190 

ttg cag tat ett gga tta cga ggg aat atg ttg act gga acg tta tct 
50 624 

Leu Gin Tyr Leu Gly Leu Arg Gly Asn Met Leu Thr Gly Thr Leu Ser 
195 200 205 

tct gat atg tgt eag eta ace ggt ttg tgg tac ttt gat gtg aga gga 
55 672 

Ser Asp Met Cys Gin Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg Gly 
210 215 220 
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aat aat eta act gga acc ate ccg gag age ate gga aat tge aca age 
720 

Asn TVsn Leu Thr Gly Thr lie Pro Glu Ser He Gly Asn Cys Thr Ser 

5 225 230 235 240 

ttt caa ate ctg gae ata tct tat aat eag ata aca gga gag att cet 
768 

Phe Gin He lieu Asp He Ser Tyr Asn Gin He Thr Gly Glu He Pro 

10 245 250 255 

tac aat ate ggc ttc etc caa gtt get act ctg tea ctt caa gga aae 
816 

Tyr Asn He Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn 
15 260 265 270 

aga ttg acg ggt aga att cca gaa gtt att ggt eta atg cag get ctt 
864 

Arg Leu Thr Gly Arg He Pro Glu Val He Gly Leu Met Gin Ala Leu 
20 275 280 285 

get gtt ttg gat ttg agt gae aat gag ctt gtt ggt cct ate cca ccg 
912 

Ala Val Leu Asp Leu Ser Asp Asn Glu Leu Val Gly Pro He Pro Pro 
25 290 295 300 

ata ctt ggc aat etc tea ttt ace gga aag ttg tat etc cat ggc aat 

960 

He Leu Gly Asn Leu Ser Phe Thr Gly Lys Leu Tyr Leu His Gly Asn 

30 305 310 315 320 

atg etc act ggt cca ate ccc tct gag ctt ggg aat atg tea cgt etc 
1008 

Met Leu Thr Gly Pro He Pro Ser Glu Leu Gly Asn Met Ser Arg Leu 

35 325 330 335 

age tat ttg cag eta aac gae aat. aaa eta gtg gga act att cca cct 
1056 

Ser Tyr Leu Gin Leu Asn Asp Asn Lys Leu Val Gly Thr He Pro Pro 

40 340 345 350 

gag ctt gga aag ctg gag caa ttg ttt gaa ctg aat ctt gee aac aae 
1104 

Glu Leu Gly Lys Leu Glu Gin Leu Phe Glu Leu Asn Leu Ala Asn Asn 
45 355 360 365 

cgt tta gta ggg ccc ata cca tec aac att agt tea tgt gba gee ttg 
1152 

Arg Leu Val Gly Pro He Pro Ser TVsn He Ser Ser Cys Ala Ala Leu 
50 370 375 380 

aat caa ttc aat gtt cat ggg aac etc ttg agt gga tct att cca ctg 
1200 

Asn Gin Phe Asn Val His Gly Asn Leu Leu Ser Gly Ser He Pro Leu 

55 385 390 395 400 



gcg ttt cgc aat 
1248 

Ala Phe Arg Asn 



aat ttc aag gga 
1296 

Asn Phe Lys Gly 
420 

10 

gac aaa eta gat 
1344 

Asp Lys Leu Asp 
435 

15 

acg ctt ggc gat 
1392 

Thr Leu Gly Asp 
450 

20 

cat ctt agt gga 
1440 

His Leu Ser Gly 
465 

25 

cag atg att gat 
1488 

Gin Met lie Asp 

30 

gaa ctt ggc caa 
1536 

Glu Leu Gly Gin 
500 

35 

aag ctt cat ggg 
1584 

Lys Leu His Gly 
515 

40 

gtc aat ctg aat 
1632 

Val Asn Leu Asn 
530 

45 

atg aaa aac ttc 
1680 

Met Lys Asn Phe 
545 

50 

tat ctt tgt gga 
1728 

Tyr Leu Cys Gly 

55 

tct cga gta ttc 
1776 




etc ggg age ttg 

Leu Gly Ser Leu 
405 

aaa ata eca gtt 
Lys He Pro Val 

ctg tct ggc aat 

Leu Ser Gly Asn 
440 

ctt gaa cac ctt 

Leu Glu His Leu 
455 

caa tta cct gca 

Gin Leu Pro Ala 
470 

gta tea ttc aat 

Val Ser Phe Asn 
485 

ttg cag aat tta 

Leu Gin Asn Leu 

aaa att cca gat 

Lys He Pro Asp 
520 

gtc tec ttc aac 

Val Ser Phe Asn 
535 

tea cgt ttt get 

Ser Arg Phe Ala 
550 

aac tgg gtt gga 

Asn Trp Val Gly 
565 

tec aga ggt get 
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act tat ctg aat 

Thr Tyr Leu Asn 
410 

gag ctt gga cat 

Glu Leu Gly His 
425 

aac ttc tea ggg 
Asn Phe Ser Gly 

etc ata tta aat 

Leu He Leu Asn 
460 

gag ttt ggg aac 

Glu Phe Gly Asn 
475 

ctg etc tee gga 

Leu Leu Ser Gly 
490 

aac tct tta ata 

Asn Ser Leu He 
505 

cag ctt acg aac 
Gin Leu Thr Asn 

aat- etc tec ggg 

Asn Leu Ser Gly 
540 

cca gee age ttt 

Pro Ala Ser Phe 
555 

tct att tgt ggt 

Ser He Cys Gly 
570 

ttg ate tgc att 




ctt teg teg aac 

Leu Ser Ser Asn 
415 

ata ate aat ctt 

He He Asn Leu 
430 

tct ata cca tta 

Ser He Pro Leu 
445 

ett age aga aac 
Leu Ser Arg Asn 

ett cga age att 

Leu Arg Ser He 
480 

gtt att cca act 

Val He Pro Thr 
495 

ttg aac aac aac 

Leu Asn Asn Asn 
510 

tgc ttc act ctt 

Cys Phe Thr Leu 
525 

ata gtc eca cca 

He Val Pro Pro 

gtt gga aat cca 

Val Gly Asn Pro 
560 

cct tta ccg aaa 

Pro Leu Pro Lys 
575 

gtt ctt ggc gtc 
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Ser Arg Val Phe Ser Arg Gly Ala Leu lie Cys lie Val Leu Gly Val 
580 585 590 

ate act etc eta tgt atg att ttc ctt gea gtt tac aaa tea atg eag 
5 1824 

lie Thr Leu Leu Cys Met lie Phe Leu Ala Val Tyr Lys Ser Met Gin 
595 600 605 

cag aag aag att eta caa ggc tec tea aaa caa get gaa ggg tta ace 
10 18-72 

Gin Lys Lys lie Leu Gin Gly Ser Ser Lys Gin Ala Glu Gly Leu Thr 
610 615 620 

aag eta gtg att etc cac atg gae atg gea att cat aca ttt gat gat 
15 1920 

Lys Leu Val He Leu His Met Asp Met Ala He His Thr Phe Asp Asp 

625 630 635 640 

ate atg aga gtg act gag aat ctt aac gaa aag ttt ata att gga tat 
20 1968 

He Met Arg Val Thr Glu Ash Leu Asn Glu Lys Phe He He Gly Tyr 
645 650 655 

ggt get tct age aeg gta tac aaa tgt gea tta aaa agt tec cga cct 
25 2016 

Gly Ala Ser Ser Thr Val Tyr Lys Cys Ala Leu Lys Ser Ser Arg Pro 

660 665 670 

att gee att aag cga etc tac aat cag tat ccg cat aac ttg egg gaa 
30 2064 

He Ala He Lys Arg Leu Tyr Asn Gin Tyr Pro His Asn Leu Arg Glu 
675 680 685 

ttt gag aca gaa ctt gag ace att ggg age att agg cac aga aac ata 
35 2112 

Phe Glu Thr Glu Leu Glu Thr He Gly Ser He Arg His Arg Asn He 
690 695 700 

gte age ttg cat gga tat gee ttg tct cct act ggc aac ctt ctt ttc 
40 2160 

Val Ser Leu His Gly Tyr Ala Leu Ser Pro Thr Gly Asn Leu L^u Phe 

705 710 715 720 

tat gae tac atg gaa aat gga tea ctt tgg gae ctt ctt eat ggg tea 
45 2208 

Tyr Asp Tyr Met Glu Asn Gly Ser Leu Trp Asp Leu Leu His Gly Ser 
725 730 735 

ttg aag aaa gtg aag ctt gat tgg gag aca agg ttg aag ata geg gtt 
50 2256 

Leu Lys Lys Val Lys Leu Asp Trp Glu Thr Arg Leu Lys He Ala Val 
740 745 750 

gga get gea caa gga eta gee tat ctt cac cac gat tgt act cct cga 
55 2304 

Gly Ala Ala Gin Gly Leu Ala Tyr Leu His His TVsp Cys Thr Pro Arg 
755 760 765 



# • 
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ate att cac cgt gac ate aag tea teg aae ata ett ctt gat gag aat 
2352 

lie lie His Arg Asp lie Lys Ser Ser Asn lie Leu Leu Asp Glu Asn 
5 770 775 780 

ttc gaa gea eat tta tet gat tte ggg att get aag age ata eea get 
2400 

Phe Glu Ala His Leu Ser Asp Phe Gly lie Ala Lys Ser lie Pro Ala 

10 785 790 795 800 

age aaa ace eat gee teg aet tat gtt ttg gga aea att ggt tat ata 
2448 

Ser Lys Thr His Ala Ser Thr Tyr Val Leu Gly Thr lie Gly Tyr lie 
15 805 810 815 

gac eea gag tat get egt aet tea ega ate aat gag aaa tee gat ata 
2496 

Asp Pro Glu Tyr Ala Arg Thr Ser Arg lie Asn Glu Lys Ser Asp lie 
20 820 825 830 

tac age tte ggt att gtt ett ett gag ett ete aet ggg aag aaa gea 
2544 

Tyr Ser Phe Gly lie Val Leu Leu Glu Leu Leu Thr Gly Lys Lys Ala 

25 835 840 845 

gtg gat aae gaa get aac ttg eat eaa etg ata ttg tea aag get gat 
2592 

Val Asp Asn Glu Ala Asn Leu His Gin Leu lie Leu Ser Lys Ala Asp 

30 850 855 860 

gat aat aet gtg atg gaa gea gtt gat eea gag gtt act gtg aet tgt 
2640 

Asp Asn Thr Val Met Glu Ala Val Asp Pro Glu Val Thr Val Thr Cys 

35 865 870 875 880 

atg gae ttg gga eat ate agg aag aea ttt eag etg get etc tta tgc 
2688 

Met Asp Leu Gly His He Arg Lys Thr Phe Gin Leu Ala Leu Leu Cys 
40 885 890 895 

aea aag ega aae ect tta gag aga cec aea atg ett gaa gte tet agg 
2736 

Thr Lys Arg Asn Pro Leu Glu Arg Pro Thr Met Leu Glu Val Ser Arg 
45 900 905 910 

gtt etg etc tet ett gte eea tet etg eaa gta gea aag aag eta ect 
2784 

Val Leu Leu Ser Leu Val Pro Ser Leu Gin Val Ala Lys Lys Leu Pro 

50 915 920 925 

tet ctt gat eac tea aec aaa aag etg cag eaa gag aat gaa gtt agg 
2832 

Ser Leu Asp His Ser Thr Lys Lys Leu Gin Gin Glu Asn Glu Val Arg 

55 930 935 940 
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aat cct gat gca gaa gca tct caa tgg ttt gtt cag ttc cgt gaa gtc 
2880 

Asn Pro Asp Ala Glu Ala Ser Gin Trp Phe Val Gin Phe Arg Glu Val 
945 950 955 960 

5 

ate tec aaa agt age ata taa 
2901 

lie Ser Lys Ser Ser lie 
965 

10 

<210> 10 
<211> 966 

15 

<212> PRT 

<213> Aradopsis thaliana ERECTA homolog 

20 

<400> 10 

Met Lys Glu Lys Met Gin Arg Met Val Leu Ser Leu Ala M.et Val Gly 
25 1 5 10 15 



Phe Met Val Phe Gly Val Ala Ser Ala Met Asn Asn Glu Gly Lys Ala 
20 25 30 

30 

Leu Met Ala lie Lys Gly Ser Phe Ser Asn Leu Val TVsn Met Leu Leu 
35 40 45 

35 

Asp Trp Asp Asp Val His Asn Ser Asp Leu Cys Ser Trp Arg Gly Val 
50 55 60 

40 Phe Cys Asp Asn Val Ser Tyr Ser Val Val Ser Leu Asn Leu Ser Ser 
65 70 75 80 

Leu Asn Leu Gly Gly Glu lie Ser Pro Ala He Gly Asp Leu Arg Asn 
45 85 90 95 



Leu Gin Ser He Asp Leu Gin Gly Asn Lys Leu Ala Gly Gin He Pro 
100 105 110 

50 

Asp Glu He Gly Asn Cys Ala Ser Leu Val Tyr Leu Asp Leu Ser Glu 
115 120 125 

55 

Asn Leu Leu Tyr Gly Asp He Pro Phe Ser He Ser Lys Leu Lys Gin 
130 135 140 



49 



Leu Glu Thr Leu Asn Leu Lys Asn Asn Gin Leu Thr Gly Pro Val Pro 
145 150 155 160 

5 

Ala Thr Leu Thr Gin lie Pro Asn Leu Lys Arg Leu Asp Leu Ala Gly 
165 170 175 

10 

Asn His Leu Thr Gly Glu lie Ser Arg Leu Leu Tyr Trp Asn Glu Val 
180 185 190 



15 Leu Gin Tyr Leu Gly Leu TVrg Gly Asn Met Leu Thr Gly Thr Leu Ser 
195 200 205 



Ser Asp Met Cys Gin Leu Thr Gly Leu Trp Tyr Phe Asp Val Arg Gly 
20 210 215 220 



Asn Asn Leu Thr Gly Thr lie Pro Glu Ser lie Gly Asn Cys Thr Ser 
225 230 235 240 

25 

Phe Gin lie Leu Asp He Ser Tyr Asn Gin He Thr Gly Glu He Pro 
245 250 255 

30 

Tyr Asn He Gly Phe Leu Gin Val Ala Thr Leu Ser Leu Gin Gly Asn 
260 265 270 

35 Arg Leu Thr Gly Arg He Pro Glu Val He Gly Leu Met Gin Ala Leu 
275 280 285 



Ala Val Leu Asp Leu Ser Asp Asn Glu Leu Val Gly Pro He Pro Pro 
40 290 295 300 



He Leu Gly TVsn Leu Ser Phe Thr Gly Lys Leu Tyr Leu His Gly Asn 
305 310 315 320 

45 

Met Leu Thr Gly Pro He Pro Ser Glu Leu Gly Asn Met Ser Arg Leu- 
325 330 335 

50 

Ser Tyr Leu Gin Leu Asn Asp Asn Lys Leu Val Gly Thr He Pro Pro 
340 345 350 



55 Glu Leu Gly Lys Leu Glu Gin Leu Phe Glu Leu Asn Leu Ala Asn Asn 
355 360 365 
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Arg Leu Val Gly Pro lie Pro Ser Asn lie Ser Ser Cys Ala Ala Leu 
370 375 380 

5 

Asn Gin Phe Asn Val His Gly Asn Leu Leu Ser Gly Ser lie Pro Leu 
385 390 395 400 

10 Ala Phe Arg Asn Leu Gly Ser Leu Thr Tyr Leu Asn Leu Ser Ser Asn 

405 410 415 



Asn Phe Lys Gly Lys lie Pro Val Glu Leu Gly His lie lie Asn Leu 
15 420 425 430 



Asp Lys Leu Asp Leu Ser Gly Asn Asn Phe Ser Gly Ser lie Pro Leu 
435 440 445 

20 

Thr Leu Gly Asp Leu Glu His Leu Leu lie Leu Asn Leu Ser Arg Asn 
450 455 460 

25 

His Leu Ser Gly Gin Leu Pro Ala Glu Phe Gly Asn Leu Arg Ser He 
465 470 475 480 

30 Gin Met He Asp Val Ser Phe Asn Leu Leu Ser Gly Val He Pro Thr 

485 490 495 

Glu Leu Gly Gin Leu Gin Asn Leu Asn Ser Leu He Leu Asn Asn Asn 
35 500 505 510 

Lys Leu His Gly Lys He Pro Asp Gin Leu Thr Asn Cys Phe Thr Leu 
515 520 525 

40 

Val Asn Leu Asn Val Ser Phe Asn Asn Leu Ser Gly He Val Pro Pro 
530 535 540 

45 

Met Lys Asn Phe Ser Arg Phe Ala Pro Ala Ser Phe Val Gly Asn Pro 
545 550 555 560 

50 Tyr Leu Cys Gly Asn Trp Val Gly Ser He Cys Gly Pro Leu Pro Lys 

565 570 575 

Ser Arg Val Phe Ser Arg Gly Ala Leu He Cys He Val Leu Gly Val 
55 580 585 590 
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lie Thr Leu Leu Cys Met lie Phe Leu Ala Val Tyr Lys Ser Met Gin 
595 600 605 



5 Gin Lys Lys lie Leu Gin Gly Ser Ser Lys Gin Ala Glu Gly Leu Thr 
610 615 620 



Lys Leu Val lie Leu His Met Asp Met Ala lie His Thr Phe Asp Asp 
10 625 630 635 640 



He Met Arg Val Thr Glu Asn Leu Asn Glu Lys Phe He He Gly Tyr 
645 650 655 

15 

Gly Ala Ser Ser Thr Val Tyr Lys Cys Ala Leu Lys Ser Ser Arg Pro 
660 665 670 

20 

He Ala He Lys Arg Leu Tyr Asn Gin Tyr Pro His Asn Leu Arg Glu 
675 680 685 



25 Phe Glu Thr Glu Leu Glu Thr He Gly Ser He Arg His Arg Asn He 
690 695 700 



Val Ser Leu His Gly Tyr Ala Leu Ser Pro Thr Gly Asn Leu Leu Phe 
30 705 710 715 720 



Tyr Asp Tyr Met Glu Asn Gly Ser Leu Trp Asp Leu Leu His Gly Ser 
725 730 735 

35 

Leu Lys Lys Val Lys Leu Asp Trp Glu Thr 7\xg Leu Lys He Ala Val 
740 745 750 

40 

Gly Ala Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys Thr Pro Arg 
755 760 765 



45 He He His Arg Asp He Lys Ser Ser Asn He Leu Leu Asp Glu Asn 
770 775 780 



Phe Glu Ala His Leu Ser Asp Phe Gly He Ala Lys Ser He Pro Ala 
50 785 790 795 800 



Ser Lys Thr His Ala Ser Thr Tyr Val Leu Gly Thr He Gly Tyr He 
805 810 815 

55 



Asp Pro Glu Tyr Ala Arg Thr Ser Arg He Asn Glu Lys Ser Asp He 
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820 825 830 



Tyr Ser Phe Gly lie Val Leu Leu Glu Leu Leu Thr Gly Lys Lys Ala 
5 835 840 845 



Val Asp Asn Glu Ala Asn Leu His Gin Leu lie Leu Ser Lys Ala Asp 
850 855 860 

10 

Asp Asn Thr Val Met Glu Ala Val Asp Pro Glu Val Thr Val Thr Cys 
865 870 875 880 

15 

Met Asp Leu Gly His lie Arg Lys Thr Phe Gin Leu Ala Leu Leu Cys 
885 890 895 



20 Thr Lys Arg Asn Pro Leu Glu Arg Pro Thr Met Leu Glu Val Ser Arg 
900 905 910 



Val Leu Leu Ser Leu Val Pro Ser Leu Gin Val Ala Lys Lys Leu Pro 
25 915 920 925 



Ser Leu Asp His Ser Thr Lys Lys Leu Gin Gin Glu Asn Glu Val Arg 
930 935 940 

30 

Asn Pro Asp Ala Glu Ala Ser Gin Trp Phe Val Gin Phe Arg Glu Val 
945 950 955 960 

35 

lie Ser Lys Ser Ser lie 
965 
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Genotype 



Figure la 
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Figure lb 
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Two experiments (a&b) 
seedlings in soil 
no fertliser added 
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Figure .2a 
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Figure 2b 
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Transpiration efficiency from gas exchange measurements 
Run 11- Dec 2001 
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Figure Ic 




Figure 2c 



